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Introduction 


The last twenty years have brought a revolution in the use of computers 
in social science To be sure, the dominant form of computer use remains 
the estimation of statistical models for data A steady supply of new models 
has arrived, from the durational models of the 1980s to the logistic re- 
gressions of the 1990s and the count-data regressions of the most recent 
decade Meanwhile the algorithms used for estimation and optimization 
have changed considerably, as 1ncreases 1n computational power enabled 
a shift toward maximum likelihood and a simulation approach to data 
Jackknives and bootstraps were rapidly replaced by the more computa- 
tionally demanding expectation maximization (EM) and Gibbs sampling 
algorithms, which in turn enabled the estimation of parameters for the 
Bayesian functional forms whose empirical plausibility had long been 
unfulfilled because of their computational ıntractabılıty 

Side by side with this deeper application of computers 1n more or less 
traditional forms of data analysis has come the rapid expansion of quite 
new approaches to data To some extent these new approaches were driven 
by the sheer size of data sets Along with speed of execution has come 
an almost unthinkable increase ın storage capacity, with the result that 
it has become possible to exploit population (rather than sample) data 
collected by automated reporting (the stock market, credit records, hos- 
pital utilization patterns, etc ) ın continuous real tıme (rather than discrete 
time or at-a-point) Biologists and computer scientists have led an assault 
on the problems of pattern recognition in such data, and the broader class 
of techniques known as data mining has begun to trickle into the social 
sciences in the guise of optimal matching techniques and the like 

But the most important changes 1n social science computation have 
come 1n the use of computers to “think through” the implications of human 
actions within given social structures—action 1n networks Such “agent- 
based modeling” has been applied to everything from the diffusion of 
norms and innovations to voting Some scholars have studied the ways 
that local networks clump into larger social structures, again using the 
immense power of computers to investigate random networks with specific 
properties Others have used computer power to visualize the evolution 
of social structures, rendering huge amounts of data into visual patterns 
easily detected and differentiated by sight 

In this issue we have brought together ten papers ın this vibrant new 
area, in the hope that readers can see for themselves what the new com- 
putational approaches have to offer sociology Running the gamut from 
the theoretical to the empirical, tackling topics from the purely theoretical 
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question of fanaticism 1n normative networks to voting, water usage, and 
the development of organizational fields, these articles demonstrate the 
breadth of computational methods We are also pleased to publish here 
the spectacular images of Powell, Macy, Koput, and Owen-Smith and 
Moody, McFarland, and Bender deMoll, both articles provide innovative 
visualizations (the latter in movie format) for complex social systems 

We begin with a theoretical paper showing the homology between 
agent-based modeling (ABM) and some existing traditions in sociological 
theory Cederman makes a direct link between a classical theoretical tra- 
dition—Simmelian process theory—and ABM He argues that simulation 
1s a direct expression of theoretical reasoning and draws on a wide variety 
of examples to show the ways ABM can account for emergent patterns 
In particular, the paper reviews a wide variety of work providing ABM 
accounts of emergence 1n interaction patterns, in configurations of prop- 
erties, ın networks, and even ın actors themselves 

With Robins, Pattison, and Woolcock, we move into a set of four papers 
(Robins et al, Chang and Harrington, Eguiluz et al , and Centola et al) 
investigating abstract questions about the structural and evolutionary 
patterns of social systems of actors The first of these directly studies the 
merging of small networks into large ones, and the latter three use ABM 
methods to investigate the evolution of networks and, 1n two cases, the 
ideas or knowledge that pass through them 

Robins et al extend the themes of Cederman's paper by showing that, 
while local structures can be relatively easily derived from simple local 
processes, the global patterns that emerge from these local structures can 
be much harder to predict or account for However, they stand within a 
purely structural tradition Without theorizing the origins of the particular 
local properties of graphs—why actors might generate certain kinds of 
connections—they work directly on the problem of the global properties 
of graphs with particular local patterns Local graphs with specified pat- 
terns (of single edges, two-stars, three-stars, and triangles) are generated 
by modification of a random graph on 100 nodes, using the Metropolis 
algorithm for optimization that 1s the grandfather of all Monte Carlo 
optimization techniques Robins et al then consider the global properties 
of the resultant graphs under a variety of initial specified patterns They 
begin therefore to classify the ways certain kinds of global structures do 
or do not emerge from various local ones 

Chang and Harrington undertake a more mainstream study ın the ABM 
tradition, although they too end up with important findings about global 
results The authors aim to understand how knowledge can be generated 
and diffused within a network 1n which individuals can either innovate 
or imitate Individuals have a “history” that determines their willingness 
to learn and a choice of from whom they learn Thus, the network of 
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learning 1s itself endogenous to the process, as are innovation and ımı- 
tation As Chang and Harrington point out, however, while at the ındı- 
vidual level innovation and imitation are substitutes, at the global level 
they are complements, since mnovation provides a pool of ideas that 
imitation can then spread Oddly, reliability of communication has the 
effect of reducing overall performance by generating excess uniformity 
50 again, this 1s a paper that underscores the often counterintuitive nature 
of the move from local to global ın properties of networks of agents 

Eguíluz, Zimmermann, Cela-Conde, and San Miguel also examine a 
network evolution problem, but they are interested not 1n the spread of 
ideas through the system, but simply m the evolution of cooperation itself 
Theirs is a much more straightforward model, m which individuals play 
a prisoner's dilemma game with their network neighbors, update their 
strategy by 1mitating the neighbor who gets the largest payoff, and then 
choose a new neighbor if they get hurt (because their neighbor was a 
defector) by giving up on the defector and choosing some random new 
neighbor The surprising result 1s that strongly hierarchical structures 
emerge, focusing on strong cooperators, but that these are subject to cas- 
cading disturbances 1f leading cooperators suddenly change their behavior 

Centola, Willer, and Macy also examine the evolution of a norm in a 
structure, but 1n their case focusing on the particular case of a norm that 
is “supported by a few fanatics and opposed by the vast majority ” After 
pointing out the wide variety of such norms 1n social experience, Centola 
et al create an ABM for the situation that models belief, willingness to 
be influenced pro or con, and willingness to pressure others The research 
question 1s whether a small group can create a cascade of compliance 
among vulnerable disbelievers The model ıs located ın various kinds of 
initial networks and shows some of the conditions for the spread or non- 
spread of such norms 

With Deffuant, Huet, and Amblard, we move into more empirical work 
Deffuant et al are concerned with the diffusion of innovations under 
conditions similar to those investigated by Chang and Harrington and 
Centola et al, while Powell et al examine the networks found among 
biotechnology firms Fowler and Smirnov study the leanings of political 
parties and voter turnout, modeling the reciprocal effects of voters on 
parties and parties on voters Moss and Edmunds apply ABM techniques 
to account for clustered volatility in water usage 

Deffuant et al 's paper combines the intellectual agendas of the Chang 
and Harrington and the Centola et al papers, 1t examines diffusion of 
innovations 1n a network that may contain extremists The paper hes 
within the classic diffusion-of-innovations tradition and clearly shows the 
immediate applicability of agent-based thinking As ın most simulation 
designs, several parameters are varied to generate quasi-experimental tests 
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of the conditions for successful innovation the presence of extremists, the 
density of the network, the diversity of initial social views of the ınno- 
vation, and the degree of uncertainty among people of moderate views 
The more complex nature of the adoption process modeled here enables 
Deffuant et al to pin down more closely the conditions under which given 
ideas spread through a population 

Powell et al are also concerned with innovation and the spread of 
ideas, but more concretely ın relation to the development of biotechnology 
They observe that biotechnology 1s typified by extensive collaboration, 
and here they map the networks as those structures grew over an eleven- 
year period By applying analyses of the structure and dynamics of the 
networks, creating visualizations that graphically show the changing 
structure, and building statistical models to describe the growth, they 
develop a clear understanding of how the linkages evolve and how this 
is related to the changing involvement of institutions universities, research 
mstitutes, venture capital, and large and small firms 

Fowler and Smirnov use ABM to model voter turnout The model takes 
advantage of the way 1n which ABMs can be used to model dynamics 
by letting the simulated voters interact before choosing whether or not 
to vote, but also letting the simulated parties choose the platforms they 
offer depending on both the political context afforded by the other parties 
and the feedback they get from the electorate Both citizens and parties 
are modeled as boundedly rational and acting on the basis of limited 
information 

Moody, McFarland, and Bender-deMoll’s work forms a backdrop to 
studies such as those by Powell et al and Fowler and Smirnov They 
consider methods for the visualization of evolving social networks and 
describe two approaches flzpbooks in which the network nodes remain 
static, but the links change over time, and movzes in which both the nodes 
and the links (edges) change The article 1s illustrated with several ex- 
amples, with the movies available in the online version of this issue of 
AJS An appendix describes the software written by the authors that was 
used to generate the movies and that 1s available for free download As 
Moody et al observe, visualization techniques are well suited to under- 
standing the kinds of complex data, varying 1n time and along several 
other dimensions, that 1s examined by Powell 1n relation to biotechnology 
networks, by Moss and Edmonds, and by others studying social processes 
in networks There 1s, however, still much to learn about the best tech- 
niques and about how to move from the exploration of data sets to a 
more confirmatory approach 

Moss and Edmonds suggest that many aggregate time series of socio- 
logical interest show characteristics, such as leptokurtosis and unpre- 
dictable clusters of volatility, that make them difficult to examine using 
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standard statistical procedures They argue that a way forward with such 
data 1s to construct ABMs on plausible microfoundations that show sim- 
ilar “statistical signatures ” These models can then be validated at both 
the individual (micro) level, by comparing the behavior of the agents with 
observations of individuals, and at the aggregate (macro) level, by com- 
paring observations of the model with aggregate survey or administrative 
data They illustrate the method with an example of a model that examines 
the influences on the demand for domestic water 

These papers provide a good indication of the range of work now going 
on using social science computation, from theorizing by using “artıficıal 
societies" as a conceptual tool and as a means for checking the ımplıcatıons 
of theoretical speculations, to using ABM to understand specific empirical 
social phenomena The papers ın this issue are exemplary ın that they 
use computational methods as a means to an end—the advancement of 
sociology—rather than stopping short when they have developed a model 
that “works ” They are also exemplary in that, while approaching the 
social world from a number of perspectives and focusing on a range of 
domains, they make 1deas of process, change, or evolution central to their 
ways of doing sociology We hope that this special issue will inspire further 
work taking forward these ideas 
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Computational Models of Social Forms: 
Advancing Generative Process Theory! 


Lars-Enk Cederman 
Swiss Federal Institute of Technology, Zurich (ETH) 


Building on Simmel’s theoretical foundations, sociological process 
theorists continue to challenge mainstream social theory So far, how- 
ever, they have rarely relied on formal modeling The author argues 
that recent advances 1n computational modeling offer tools to ex- 
plore the emergence of social forms ın the Simmelian tradition 
Thanks to common foundations 1n both epistemology and ontology, 
these two fields can benefit from drawing more explicitly on each 
other The process-theoretic tradition ın social theory and contem- 
porary agent-based models shift theorizing from nomothetic to gen- 
erative explanations of social forms, and from variable-based to 
configurative ontologies In order to formalize sociational theory, the 
author focuses on how to model dynamic social networks and emer- 
gent actor configurations 


Thanks to theoretical advances 1n the natural sciences and the decreased 
cost of computer technology, computational modeling 1s becoming an 1n- 
creasingly popular tool 1n the social sciences Due to its relative novelty 
and somewhat marginal position ın most disciplines, however, research 
of this kind has primarily focused on methodological challenges posed by 
applications to social phenomena By contrast, the method's theoretical 
foundations are still relatively poorly understood and many theoretical 
possibilities remain unexplored by computational scholars At the same 
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time, social theorists, following ın the footsteps of Georg Simmel’s pio- 
neering contributions a century ago, have developed a process-based re- 
search tradition that anticipates the scientific practices of today's com- 
puter-based research In short, 1f the sociological process theorists have 
been computational modelers avant la lettre, the latter can be seen as 
process theorists “après la lettre ” 

In view of this apparent conceptual convergence, 1t would seem useful 
to bring the two intellectual traditions together Agreeing with recent 
efforts to bridge the two fields (e g , Hanneman, Collins, and Mordt 1995, 
Sawyer 1998, 2003, Muller, Malsch, and Schulz-Schaeffer 1998, Sallach 
2000, Macy and Willer 2002), I argue that both stand to benefit from such 
cross-fertilization Scientists relying on computational techniques would 
gain a better understanding of what they are actually doing by anchoring 
their research more firmly in social theory, both 1n terms of epistemology 
and ontology This research strategy would enable them to respond more 
effectively to criticism that assumes alternative philosophical positions 
Even more important, such a widening of the ontological spectrum would 
help them recognize opportunities that a focus on a more narrow, method- 
driven agenda may obscure 

Conversely, I assert that such an intellectual bridge-building exercise 
could assist process theorists working ın the Simmelian mold as well 
Customarıly relying on rather loose metaphors and analogies, these the- 
orists have struggled to grasp the consequences of their uncompromising 
assumptions Although some of them have devised simple models, their 
analysis has generally remained qualitative and intuitive Due to their 
compatibility with the basic principles of the process perspective, com- 
puter-based techniques provide unique conceptual resources to capture 
the evolution of complex social forms 1n time and space 

In this article, J limit the scope to macrotheoretic problems It 1s 1n this 
domain that formal modeling 1s the most needed, but also where its ap- 
plication causes the most severe difficulties due to the overwhelming com- 
plexity of social systems (Hanneman et al 1995, Cederman 1997) Com- 
puter-based research promises to overcome some of the conceptual hurdles 
that have provoked a powerful trend away from systemic thinking to 
microlevel theorizing, especially ın the rational-choice tradition (Collins 
1999) 

I proceed by highlighting the main epistemological and ontological prin- 
ciples characterizing sociological process theory before turning to the cor- 
responding dimensions m contemporary computational research Then, a 
section illustrating different types of social forms ın agent-based modeling 
follows A concluding section assesses what each literature has to offer 
and what theoretical gaps remain to be filled 
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SOCIOLOGICAL PROCESS THEORY 


Reacting to positivist currents ın the late 19th century, Georg Simmel 
pioneered a distinctive tradition of process-driven theorizing ın sociology 
Rather than assuming that social reality consists of fixed and given entities, 
whether individual or collective, Simmel conceived of 1t as an ongoing 
interactive process giving rise to social forms ın a spatiotemporal contin- 
uum “The large systems and the super-ındıvıdual organizations that cus- 
tomarily come to mind when we think of society, are nothing but ım- 
mediate interactions that occur among men constantly every minute, but 
that have become crystallized as permanent fields, as autonomous phe- 
nomena” (Simmel quoted 1n Wolff 1950, p 10) 

Stressing the emergent quality of such social forms, Simmel refers to 
them under the heading of sociation (“Vergesellschaftung,” see Wolff 1950, 
p lxi Viewed ın sociational terms, society 1s thus not a “‘substance,’ 
nothing concrete, but an event” (Simmel quoted ın Wolff 1950, p 11, 
original emphasis) Languages, social structures, norms, and conventions 
are created through “societal production, according to which all these 
phenomena emerge 1n interactions among men" (p 13) According to Sim- 
mel, the primary goal of sociology 1s to analyze social forms "Sociologists 
are directed to identify and classify different forms of social interaction 
to analyze their subtypes, to study the conditions under which they 
emerge, develop, flourish, and dissolve, and to investigate their structural 
properties” (Levine 1971, pp xxvu—xxvil) 

This “process worldview,” which parallels the strictures of Whitehead’s 
philosophy (Fararo 1989), has also given rise to a rich tradition of schol- 
arship that explicitly, or sometimes implicitly, follows Simmel’s theoretical 
lead In the United States, primarily thanks to the efforts of Albion Small, 
Robert Park, and George Herbert Mead, the Chicago school became an 
influential bastion of sociological process theory Its main insight tells us 
that “every social fact 1s situated, surrounded by other contextual facts, 
and brought into being by a process relating 1t to past contexts” (Abbott 
1997, p 1152) 

Despite the influence of Simmel’s work and that of his American suc- 
cessors during the first decades of the 20th century, mainstream sociology 
went off in a very different direction, largely under the influence of Par- 
sons’s relatively static theorizing (Levine 1991) and the trend favoring 
decontextualized quantitative research methods (Abbott 1997) However, 
the process perspective continued to thrive in other scholarly contexts 
Outside the mainstream, for example, Norbert Elias (1978, 1982) devel- 
oped his own somewhat idiosyncratic macrohistorical process theory 
based on the notion of “figurations ” In anthropology, Fredrik Barth (1981) 
stands out as one of the greatest process theorists Like Simmel’s original 
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research agenda, Barth’s theoretical approach accounts for social forms 
by showing how they are generated by social processes 

More recently, social theorists have attempted to revive and synthesize 
these and related streams of thmking by promoting “relationism” (Emir- 
bayer 1997), “structuration” (Giddens 1979), a “morphogenetic” perspec- 
tive (Archer 1995), or the continued relevance of the Chicago school (Ab- 
bott 1997) It goes without saying that this 1s a somewhat sprawhng 
literature with considerable internal tensions ? Nevertheless, ıt is possible 
to discern a common metatheoretical pattern of inquiry all these process 
approaches advance generative explanations of social forms conceptual- 
ized as configurations To articulate these principles more clearly, let us 
now consider the epistemological notion of generative theory (How do we 
explain?) before turning to the ontological issue of social forms (What do 
we explain?) 


FROM NOMOTHETIC TO GENERATIVE EPISTEMOLOGY 


Generative theory differs radically from the positivist focus on explanation 
in terms of Jaws and regularities Most of today’s social science subscribes 
at least loosely to a Humean regularity notion of causation, requiring a 
constant conjunction of factors, typically across a large number of cases 
Such a conception of what it means to explain has been remforced by 
the wider use of statistics across the disciplines For example, the political 
scientists King, Keohane, and Verba (1994) argue that such a logic of 
social inquiry applies regardless of whether the analysis proceeds gualı- 
tatively or quantitatively These, and many other authors (e g , Goldthorpe 
1997), emphasize the search for, and explanation 1n terms of, causal reg- 
ularities through comparative analysis or statistical control as the primary 
goal of social science Within the philosophy of science, Hempel's (1965) 
idea of covering laws provides the clearest and most sophisticated ex- 
pression of this nomological epistemology 

How then, can explanation be achieved, if not by reference to regu- 
larities? The sociological process approach starts with an observed social 
phenomenon, whether unique or ubiquitous, and then postulates a process 
constituted by the operation of mechanisms that together generate the 
phenomenon in question Ultimately, explanatory value resides in the 


* For example, Archer (1995) rejects G1ddens's “ınseparabılıty claim linking individuals 
with social structure Likewise, there exist differences as to how radical these authors? 
rendering of process ontology 1s” (see Sawyer 20025) It should also be noted that 
symbolic ınteractıonısm in the Meadian tradition has come to focus mostly on micro- 
sociological problems, thus partly deviating from the macrofocus of this article 
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specification of (often unobservable) mechanisms and the reconstruction 
of a process within which they are embedded ? 

The process theorists’ preference for generative explanations stems from 
the difficulties of finding and isolating regularities 1n complex social sys- 
tems Although Simmel (118901 1989) was especially pessimistic on this 
point, he did not exclude the possibility of identifying ideal-typical forms 
and mechanisms under a dıverse set of conditions Ultimately, however, 
the question of whether there are macrolevel regularities 1s not essential 
to the generative approach, because even 1n those cases where they can 
be said to exist, process theorists wouid regard them as insufficient and 
superficial substitutes for the deeper understanding yielded by a gener- 
ative explanation Simmel believed that ıt was 1mpossible to account for 
social forms without describing how they are generated through a “genetic 
method ” Instead of resorting to divine intervention or individual will, 
such entities should be explained “by the notion of societal production, 
according to which all these phenomena emerge in interaction among 
men, or sometimes, indeed, ave such interactions" (Wolff 1950, p 13, 
origmal emphasis) 

Using various labels, such as “abductıve” or “retroductive” ınference or 
“inference to the best explanation,” philosophers since Peirce distinguish 
this pattern of inquiry from both induction and deduction (Lipton 1991) 
Intimately linked to scientific realism, this notion of causal reasoning has 
gradually gained terram at the expense of Hempel’s nomothetic ideal 
Examples abound m the natural sciences, especially in historically oriented 
fields such as geology and evolutionary biology (McMullin 1964) As we 
have seen, sociology is no exception from this trend Although Simmel 
did not explicitly use these terms, his method has been characterized as 
“abductive” (Krahnke 1999) * The same goes for much of social theory, 
including Giddens's structuration theory (e g, Wendt 1987, 1969) 

The construction of generative explanations based on abducive infer- 
ence 1s an inherently theoretical endeavor (McMullm 1964) Instead of 
subsuming observations under laws, the main explanatory goal is to make 
a puzzhng phenomenon less puzzling, something that mevitably requires 
the introduction of new knowledge through theoretical mnovation Sim- 
mel's response to this challenge was to rely heavily on metaphors and 


* In its generative logic, the process-theoretic approach to mechanisms 1s similar to 
that of some rationalistic analysts who emphasize mechanism-based explanations (e g , 
Hedstrom and Swedberg 1993) Note, however, that truly relatıonıst process sociology 
rejects the reification of actors assumed by rational-choice theory, as well as 1ts tendency 
to regard mechanisms as the "poor cousin" of nomological explanations (see Abbott 
1996) 


* Some scholars argue that Simmel did not engage 1n causal theorizing at all (see Sylvan 
and Glassner 1985) 
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analogies, thus shedding new and unexpected light on his subject of study 
(Simmel 1989, see also Krahnke 1999, p 94) Likewise, Elias (1978) resorts 
to metaphoric constructs, such as dance and the use of personal pronouns, 
to clarify the meaning of his social figurations 

Though they regard metaphors and analogies as integrated parts of 
scientific practice, realist philosophers of science also stress the pivotal 
role of models 1n the construction of generative theories Like metaphors 
and analogies, models mediate knowledge from a known “source domain” 
to a “target domain," but they do so 1n a more consistent and precise 
fashion, especially m complex settings (Harré 1970) The main function 
of models, then, derives from their capacity to trace and reconstruct causal 
processes 1n a coherent way 

It thus comes as no surprise that many efforts to spell out a process- 
theoretic vision often involves explicit model building Fredrik Barth of- 
fers an especially clear justification of such a research program In his 
view, generative models 


are not designed to be homologous with observed social regularities, instead 
they are designed so that they by specified operations, can generate such 
regularities or forms They should be constituted of a limited number of 
clearly abstracted parts, the magnitude or constellation of which can be 
varied, so that one model can be made to produce a number of different 
forms Thus by a series of logical operations, forms can be generated, these 
forms may be compared to empirical forms of social systems, and where 
there 1s correspondence in formal features between the two, the empirical 
form may then be characterized as a particular constellation of the variables 
in the model (Barth 1981, p 32) 


Note that Barth distinguishes generative explanations from the mere 
discovery of emergent social forms Clearly, it 1s not sufficient to identify 
the associations Rather, what 1s needed 1s a deeper explanatory recon- 
struction of how the social forms ın question were generated “Explanation 
is not achieved by a description of the patterns of regularity, no matter 
how meticulous and adequate, nor by replacing this description by other 
abstractions congruent with it, but by exhibiting what makes the pattern, 
1e, certam processes To study social forms, it 1s certainly necessary but 
hardly sufficient to be able to describe them To give an explanation of 
social forms, it 1s sufficient to describe the processes that generate the 
form” (Barth 1981, pp 35-36) 

Reasoning along simular lines, Fararo (1989) elaborates a general con- 
ceptual framework for generative theory In agreement with other modern 
process theorists, Fararo subscribes to realist philosophy of science To 
explain an empirical macrophenomenon, he argues, a social scientist 
should “construct a generating process” that produces it (p 43) Processes 
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are constituted by recursively iterated operations of generative mecha- 
nisms and rules, very much like a computer program At the same time, 
Fararo's realist approach to explanation raises the difficult question of 
how to model process 


In the context of present-day empirical research, a great deal of stress 1s 
placed on the specification of variables and “explaining” variation in one 
variable by appeal to vartation ın a battery of other variables Thus con- 
ception of explanation 1s based on a legitimate aspect of the dynamic or 
process-oriented frame of reference of general theoretical sociology Namely 

as the generative mechanism produces changes of state, ıt does so 
under parametric conditions. Variations in parametric conditions, either 
dynamically or comparatively, thereby produce variations m states vza the 
generative mechanism What ıs omitted ın the usual accounts of “causal 
models” based on “explaining variation” ıs the explicit formal representation 
of process (Fararo 1989, p 42) 


This representational challenge makes ıt 1mpossible to further postpone 
a discussion of ontology As argued 1n the next section, sociological process 
theory differs from mainstream approaches ın this respect as well 


FROM VARIABLE-BASED TO CONFIGURATIVE ONTOLOGY 


True to its scientific realist principles, sociological process theory requires 
explanations to specify theoretical entities, relations, and mechanisms that 
together generate the social forms to be explained Often these components 
have to be postulated because they are unobserved, or 1n some cases, even 
unobservable (Miller 1987) By contrast, positivist research tends to ex- 
press causation in terms of variables that indicate properties of social 
entities, relations, and mechanisms without specifying these components 
directly To the extent that social forms and their components are pos- 
tulated at all, this is done primarily for instrumental reasons, without 
attributing actual existence to the theoretical building blocks After all, 
prediction of actual outcomes 1s often more important than explanation 
in these nonrealist perspectives (see Friedman 1953) 

The difference between variable-based ontologies and process-theoretic 
alternatives can be traced back to Sımmel”s general distinction between 
“form” and “content ” In his formal sociology, Simmel stressed the former 
at the expense of the latter (Levine 1971) ” By virtue of its ability to capture 


* Sımmel”s separation of form from content has been criticized by those who think 
that these two dimensions of social reality are intricately intertwined (see e g , Durk- 
heim 1960, p 357) This criticism resembles objections to Barth’s formal anthropology 
or to Chomsky’s attempt to develop a syntactic theory of linguistics without reference 
to semantics 


870 


Computational Models of Social Forms 


form, Simmelian sociology thus differs from conventional approaches that 
“do not describe social processes, but merely correlations between ele- 
ments Simmel studies the concrete sociations between persons and groups, 
not indicators of their outcomes (number killed, riot frequencies, etc ), and 
explains social structures as the synthesis of these interactions” (Sylvan 
and Glassner 1985, p 45, see also p 92) 

Despite the importance of social form ın Simmel’s theory, however, ıt 
is hard to find a clear definition of this key concept in his work (Wolff 
1950, p xxxix) Levine defines social forms as “the synthesizing principles 
which select elements from the raw stuff of experience and shape them 
into determinate units" (Levine 1971, p xv), but that definition seems too 
broad for the purposes of this discussion. On the other hand, Barth's 
(1981, p 32) equation of social forms with “series of regularities in a large 
body of individual items of behaviour” 1s too narrowly behaviorist For 
the present purposes, then, 1t seems reasonable to define soczal forms as 
configurations of social interactions and actors that together constitute 
the structures ın which they are embedded * Furthermore, configurative 
theories, as opposed to variable-based ones, contain an explicit represen- 
tation of configurations, whether exogenous or endogenous 

From the assumptions of generative theory, 1t follows that all social 
forms are dynamically produced through social processes Or to put it 
even more starkly social forms are by definition social processes The 
extent to which actors and structures vary over time depends on the 
particular case, however Some processes generate configurations of be- 
havior or attitudes without any change in their component entities or their 
ınteractıon structures Other processes are more complex ın that they 
feature change ın actors’ boundaries and their structural arrangement 
Configurative theories that endogenize such phenomena can be charac- 
terized as soczational (Vergesellschaftungstheovien), to use Sımmel”s ter- 
minology The longer the time span, and the more profound the societal 
transformation, the more likely it 1s that sociational processes are oper- 
ating Considering primarily macroprocesses ın the Simmelian tradition, 
I will focus on such complex configurative transformations of actors and 
structures 

It ısın these cases that the weaknesses of the varıable-based paradigm 
become most evident, for, as has already been stated, variables merely 
measure dimensions of social forms, they cannot represent the forms them- 
selves except ın very simple cases Such variable-based analysis “detaches 
elements (substances with variable attributes) from their spatiotemporal 
contexts analyzing them apart from their relations with other elements 
within fields of mutual determination and flux” (Emirbayer 1997, p 288) 


6 See also Ehas's (1978) notion of figuration 
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By contrast, social forms always possess a duration 1n time and an ex- 
tension 1n physical or some abstract, conceptual space Thus they can be 
said to have their own “bodies” or “corporate identities” ın a literal or 
figurative sense The latter case applies to collections of individuals, such 
as social groups or organizations (Wendt 1999) 

Theories that “hardwire” their actors and structures into their ontologies 
can be characterized as “essentialist” or “substantıalıst " In these cases, 
“analysis 1s to begin with these self-subsistent entities, which come ‘pre- 
formed,’ and only then consider the dynamic flows ın which they sub- 
sequently involve themselves" (Emirbayer 1997, p 282) In contrast, so- 
ciational theories, which Emirbayer labels “relationist,” “reject the notion 
that one can posit discrete, pregiven units such as the individual or society 
as ultimate starting points of sociological analysis" (p 287) Such a per- 
spective “sees relations between terms or units as preeminently dynamic 
1n nature, as unfolding, ongoing processes rather than as static ties among 
inert substances" (p 289) 

The overwhelming majority of all social science theories falls 1nto the 
variable-based category In a lucid article on explanations of macropo- 
litical processes such as democratization and state formation, Charles Tilly 
provides examples from historical sociology showing how almost all an- 
alysts ın that field “(1) assume a coherent, durable, self-propelling social 
unit, (2) attribute a general condition or process to that unit, (3) 1nvoke 
or invent an invariant model of that condition or process, (4) explain the 
behavior of the unit on the basis of its conformity to that invariant model” 
(Tilly 1995, p 1595) 

Yet this practice makes very little sense, because “coherent, durable, 
self-propelling social units—monads—occupy a great deal of political the- 
ory but none of political reality” (Tilly 1995, p 1596) 

Likewise, suggesting that mainstream sociology postulates “that the 
social world consists of fixed entities (the units of analysis) that have 
attributes (the variables),” Andrew Abbott (1988) criticizes the substan- 
tialist mainstream position for making strong methodological assumptions 
about social reality that unduly limit or distort the underlying ontology 
Most seriously, the conventional, variable-based paradigm “ignores entity 
change through birth, death, amalgamation, and division” (pp 171—72) 
These are all themes that have recurred in the process-theoretic literature 
since Simmel ([1908] 1992) asked the fundamental question about the 
intersubjectivity, duration, and spatial extension of social forms (Ceder- 
man and Daase 2003) 
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AGENT-BASED MODELING 


Sociological process theory offers suggestive insights into the emergence 
of social forms Yet its potential is partly unrealized, because, as we have 
seen in the previous section, 1t remains a somewhat underspecified re- 
search program 1n terms of modeling Most process theorists have had to 
content themselves with employing metaphors and analogies, although 
some have developed simple models (e g , Ehas 1978, Barth 1981) In this 
section, I therefore turn to formal modeling as a way to improve con- 
ceptual precision and theoretical consistency Given the complexity of 
macrosociological research problems, I focus here on computational mod- 
eling rather than on rational-choice models and, within the computational 
category, more specifically on agent-based modeling 

Agent-based modeling 1s a computational methodology that allows scı- 
entists to create, analyze, and experiment with artificial worlds populated 
by agents that interact in nontrivial ways and that constitute their own 
environment (eg, Axelrod 1997, Casti 1997, Epstein and Axtell 1996, 
Epstein 1999, Macy and Willer 2002) In these “complex adaptive sys- 
tems,” computation 1s used to simulate agents’ cognitive processes and 
behavior in order to explore emergent macrophenomena, for example, 
structural patterns that are not reducible to, or even understandable ın 
terms of, properties of the microlevel agents (Holland 1995, Cederman 
1997, chap 3) 

Since Herbert Simon’s (1981) brilliant essay on the “architecture of 
complexity” and Friedrich von Hayek’s (1967) pioneering writings on 
social complexity, researchers in the social sciences have increasingly come 
to the conclusion that complex systems call for different theories and 
methods from those used to study simple ones Growing out of the tra- 
dition of general systems theory, complexity theory typically puts more 
weight on understanding the operation of mıcrolevel mechanisms than 
its systems-theoretic predecessors did (Rosser 1999, p 184) 

In recent years, complexity theory has acquired a new momentum 
across the sciences thanks to developments ın modern statistical physics 
together with interdisciplinary research, much of which has been con- 
ducted at the Santa Fe Institute in New Mexico (Waldrop 1992) This 
kind of theorizing should not be confused with related concepts such as 
catastrophe theory or chaos theory (Rosser 1999) While sharing the as- 
sumption of nonlinear interactions with much of today's complexity the- 
ory, both these notions are usually applied to situations of low dimen- 
sionality More central to the development of complexity theory are the 
nonequilibrium approaches developed by European physicists, especially 
that of the controversial Nobel laureate Ilya Prigogine and his associates, 
who specialized in the physics of dissipative systems Aided by compu- 
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tational techniques, the current generation of complexity research has 
started to make inroads ınto the social sciences (e g , Axelrod and Cohen 
1999, Rıhanı 2002, Urry 2003) 

Agent-based models of complex adaptive systems typically feature local 
and dispersed interaction rather than centralized control] (Resnick 1994) 
Moreover, as opposed to conventional rational-choice models that assume 
either a small number of dıssımılar or numerous identical actors, agent- 
based models normally include large numbers of heterogeneous agents 
(Epstein and Axtell 1996) Rather than studying equilibrium behavior, 
the focus 1s often on dynamics and transient trajectones far from equi- 
librium Finally, instead of assuming the environment to be fixed, many 
agent-based models let the agents constitute their own endogenous 
environment 

Agent-based approaches should also be contrasted to earlier uses of 
simulation m the social sciences, including the tradition of global modeling 
(see, eg, Gilbert and Troitzsch 1999, Taber and Timpone 1996, pp 48- 
49)’ Such macromodels, which were especially popular in the 1970s fol- 
lowing the pessimistic report Lzmzts to Growth, typically attempt to predict 
population and economic trends many years, sometimes even decades, 
into the future (eg, Forrester 1971, Meadows, Randers, and Meadows 
1972) By the early 1990s, the method had fallen m disrepute, mostly ın 
response to overly ambitious predictive claims Stull, ıt continues to inspire 
theory building among some sociologists (e g, Hanneman, Collins, and 
Mordt 1995, see also Sawyer 2003) 

To further distinguish agent-based modeling from the dynamic systems 
tradition, and to underline the congruence between this methodology and 
sociological process theory, let us now return to the two metatheoretical 
themes of the previous section By doing so, ıt will become clear that 
agent-based tools enable the social theorist to pursue generative and con- 
figurative theonzmg without abandoning the support of formal models 


FROM NOMOTHETIC TO GENERATIVE MODELS 


Whereas most traditional uses of simulation modeling center on predic- 
tion, agent-based modelers usually rely on generative explanation This 
epistemological orientation has been evident since Schelling’s (1978) clas- 
sical study Mzcromotıves and Macrobehavior Using an example of ag- 
gregate behavior, such as people choosing seats 1n an auditorium, Schelling 


7 There are also other types of computational approaches to social science, such as 
rule-based models and natural language processing derived from artificial intelligence, 
but these fall outside the purview of this review (see Bainbridge et al 1994, Taber 
and Timpone 1996) 
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(1978, p 13) invites the reader “to try to figure out what intentions, or 
modes of behavior, of separate individuals could lead to the pattern we 
observed ” Obviously, 1t may be quite easy to figure out what the social 
pattern 1s going to be, but often the aggregate behavior surprises us 
Schelling points out that the latter typically applies when there are stra- 
tegic connections among the individuals Such settings do not lend them- 
selves to easy summation of the individual motives, which implies that 


we usually have to look at the system of interaction between individuals 
and other individuals or between individuals and the collectivity And some- 
times the results are surprising Sometimes they are not easily guessed 
Sometimes the analysis 1s difficult Sometimes it 1s inconclusive But even 
inconclusive analysis can warn against jumping to conclusions about in- 
dividual mtentions from observations of aggregates, or Jumping to conclu- 
sions about the behavior of aggregates from what one knows or can guess 
about individual intentions (Schelling 1978, p 14) 


It 1s evident that this pattern of explanation conforms closely to the 
abductive logic discussed above Adopting a realist as opposed to an 
ınstrumentalıst position, Schelling (1978) ıs primarily interested ın un- 
covering the causal mechanisms that generate surprising macrolevel pat- 
terns (p 18) This explanatory strategy 1s most clearly exemplified 1n his 
famous segregation model, which 1s one of the world’s first agent-based 
models Puzzled by the stark segregation patterns that appear, for ex- 
ample, ın American cities, Schelling postulates a simple and quite tolerant 
microlevel rule that produces two perfectly cultural clusters 1t1s sufficient 
to assume that the residents of each group are unwilling to live ın neigh- 
borhoods almost totally dominated by the other group 

In another influential statement, Axelrod (1997) articulates similar prin- 
ciples for agent-based research Distinct from both induction and deduc- 
tion, the agent-based method “ıs a third way of doing science" (p 3) 
Rather than creating accurate predictions, “agent-based modeling 1s a way 
of doing thought experiments Although the assumptions may be simple, 
the consequences may not be at all obvious” (p 4, see also Gilbert 2000) 
Ultimately, Axelrod's primary aim 1s to advance a better theoretical un- 
derstanding by uncovering the mechanisms that generate “emergent prop- 
erties” (see also Holland 1995, p 156) Like Schelling, Axelrod therefore 
stresses simplicity rather than realism 

Explicitly using the label “generative social science," Epstein (1999) 
further articulates the lmk to the theme of generative explanation 1n so- 
ciological process theory In his view, agent-based modelers working in 
this mold have to answer a general question *How could the decentralized 
local interactions of heterogeneous autonomous agents generate the given 
regularity” (p 41)? The preliminary answer to the question 1s to generate 
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a process that generates the phenomenon under scrutiny It should be 
stressed that this approach ts necessarily tentative, because abductive 
inference can only produce “candidate explanations ” Or to put 1t more 
tersely “If you can’t grow it, you haven’t explained it” This formula 
obviously begs the question of how we are to select among the candidate 
explanations One way to do this 1s to test the microlevel implications, as 
suggested by Epstein (p 43) 

Here the scientific-realist foundations of the computational paradigm 
become particularly evident, for 1n their search for causal mechanisms, 
agent-based modelers such as Axelrod, Schelling, and Epstein reject ın- 
strumentalist reasoning? In summary, Gilbert and Chattoe put it well 
“The realist epistemology 1s also a natural progenitor of computational 
simulations It is a short step from developing theoretical models of mech- 
anisms to developing computational models of mechanisms It 1s also the 
case that simulations provide an appealing way of both formalising and 
exploring realist theoretical models” (2001, p 114) 

Frequently, computational researchers introduce generative explana- 
tions under the label of “emergence,” although the latter 1s really a subset 
of the former because not all generative explanations exhibit emergence 
This term is notoriously tricky to define in hght of its varying uses in 
both the computational (Holland 1998) and the sociological and phılo- 
sophical hteratures (Sawyer 2002a), but ıt could be said that emergence 
usually refers to the fundamental irreducibility of complex systems to their 
constitutive parts The failure of reductionism in such settings often relates 
to the nonlinearity of the systemic interconnections, something that implies 
that the whole 1s literally more (or less) than the sum of the parts (Simon 
1981, Smith and Stevens 1996) 

Simple forms of emergence 1n agent-based models rely on a straight- 
forward ascending logic featuring self-organized patterns that are com- 
patible with methodological individualism (Macy and Willer 2002), which 
is why Epstein and Axtell (1996) refer to agent-based modeling as a 
“bottom-up” approach Schelling’s model of residential segregation ex- 
emplifies such an ascending notion of emergence But, as illustrated by 
Durkheım's nonreductionist social theory, there 1s also a deeper sense of 
emergence that relies on “downward causation” (Sawyer 2002a) In this 
case, emergent social phenomena acquire a causal impact and cannot even 


5 Obviously, the agent-based method 1s not the only type of formal modeling that 1s 
compatible with a generative epistemology Although rational-choice models are usually 
associated with a nomothetic approach to explanation (e g , Green and Shapiro 1994), 
several theorists stress how rationalistic modeling can be used as a way to explain 
phenomena ın terms of their generating mechanisms (Elster 1989, Scharpf 1997) Fur- 
thermore, 1t has already been noted that variable-based macrosimulations can also be 
employed dynamically with the goal of formalizing theories 
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in principle be reduced to laws and mechanisms operating at the micro- 
level In a careful conceptual analysis, Crutchfield (1994) calls this type 
of emergence “intrinsic” because 1t presupposes that novelty be detected 
and acted upon by the systems’ agents “The observer ın this view 1s a 
subprocess of the entire system In particular, ıt ıs one that has the requisite 
information processing capability with which to take advantage of the 
emergent patterns” (pp 3-4) 

Given that most existing agent-based models assume that agents are 
equipped with very simple rules and possess extremely primitive cognitive 
capacity (cf Axelrod 1997, Schelling 1978), ıt ıs not surprising that Sawyer 
(1998, 2002a, 2003) argues that such frameworks fail to embrace intrinsic 
emergence Limiting their schemes to self-organization, computational 
modelers conceive of emergent properties as epiphenomenal configura- 
tions that emerge from dynamic webs of interactions, but that do not feed 
back down to the microlevel through the agents’ internal models Yet as 
Sawyer (2003) admits, the macrolevel may affect the microlevel through 
ecological effects 1n agent-based models, and to that extent, they are less 
reductionist than rational-choice models that restrict the systemic view 
Nevertheless, by limiting top-down emergence to such an external logic, 
current computational modeling still does not fully capture the sociological 
process tradition, which subscribes to à nonindividualist ontology fea- 
turing downward causation in both the intrinsic and external sense I will 
return to this important issue ın the context of viewing collective actors 
as social forms below 


FROM. VARIABLE-BASED TO AGENT-BASED MODELING 


By now, 1t should be clear that sociological process theory distinguishes 
itself from conventional “variable-based” theory building by featuring 
explicit and endogeneous representations of social forms Again, the clas- 
sical process perspective in sociology anticipates more recent develop- 
ments 1n computational methodology In fact, agent-based modeling has 
often been contrasted to the variable-based approach to simulation In 
the latter form, computer-based modeling usually amounts to the con- 
struction of dynamic models linking variables together 1n systems of dis- 
cretized differential equations (ı e , difference equations), as exemplified 
by global modeling In ecology and population biology, this contrast pits 
“ındıvıdual-based” against “equation-based” modeling (Parunak et al 
1998, see also McCauley et al 1993, Fahse et al 1998) 

According to Parunak et al (1998), there 1s a fundamental difference 
between these two types of approaches Whereas equation-based modeling 
attempts to express causal relations among variables, individual-based 
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modeling represents ınteractıons among the agents directly This does not 
mean that the individual-based paradigm 1s incapable of tracing the evo- 
lution of macrolevel variables In fact, “the modeler pays close attention 
to the observables as the model runs, and may value a parsimonious 
account of the relations among those observables, but such an account 1s 
the result of the modeling and simulation activity, not its starting point” 
(p 10) The opposite does not hold, however, because equation-based 
modeling makes it difficult to represent mechanisms operating at the agent 
level, especially if they do not add up ın a linear fashion In systems 
featuring heterogeneous and spatially distributed agent populations, 
agent-based modeling has an edge over equation-based techniques thanks 
to 1ts superior tractability and increased opportunities of microlevel val- 
idation and calibration 

Modern software technology has paved the road for agent-based model 
architectures In particular, object-oriented programming (OOP) facılı- 
tates the representation of agents as autonomous and discrete entities ın 
simulation models (Hill 1996, chap 2) In contrast to traditional proce- 
dural approaches which separate data from algorithms, the principle of 
encapsulation introduces objects as self-contained bundles holding both 
data and algorithms Objects communicate with each other through mes- 
sage passing that respects their boundaries. Furthermore, as opposed to 
information held ın conventional programming variables, each such entity 
has its own inherent zdenézty, implying that “two objects are distinct even 
if all their attribute values are identical” (Rumbaugh et al 1991, p 
2, see also Khoshaflan and Copeland 1986) Thus, an object has 1ts own 
dynamic existence ın that ıt can be created and destroyed This 15 an 
aspect of identity that 1s sometimes referred to as perszstence (Booch 1991) 
It 1s hardly a coincidence that the world’s first OOP language, Simula, 
was developed to conceptualize the semantics of simulation models ına 
natural way (Hill 1996, p 25) 

More recently, multiagent systems (MAS) have added the notion of an 
agent to the repertoire of computer science As the term 1s used 1n dis- 
tributed artificial intelligence (DAI), agents are more autonomous than 
objects by virtue of their ability to refuse performing messages and to 
effect flexible behavior without central program control Qe, multi- 
threading, see Wooldridge 1999, Ferber 1999, p 57) Depending on the 
complexity of their internal models, MAS feature either reactive or cog- 
nitive agents So far, agent-based applications 1n the social sciences have 
tended to be of the former type (Sawyer 2003) 

Although OOP and MAS may appear to be merely technical details, 
these advances reflect an important semantic shift away from procedural 
software engineering to more decentralized paradigms—a move that fa- 
cilitates the parallel trend pointing from equation-based to agent-based 
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modeling After all, social agents are self-propelled, autonomous entities 
with distinct, time-dependent properties In complex systems, it 1s espe- 
cially useful that encapsulation forces the analyst to declare explicitly what 
information that agent has access to, and which of their own properties 
are visible to other agents Identity 1s important because each agent can 
be represented as an independent and unique object zmstance of a more 
general template, usually referred to as a class in OOP languages As 
opposed to conventional variables, which ın principle have infinite life 
spans, agents implemented as dynamically allocated objects possess an 
explicit existence 1n time and can therefore both be born and die Beyond 
object orientation, MAS techniques emanating from DAI are likely to 
Inspire conceptual breakthroughs in theory building and further 
strengthen the trend toward agent-based perspectives ın social science 
modeling 


MODELING THE EMERGENCE OF SOCIAL FORMS 


The previous discussion has shown that computational modelers tend to 
follow the research practices of sociological process theorists in that they 
employ generative and configurative explanations Less has been said 
about the types of social forms that figure in existing computational re- 
search designs In this section, I therefore provide a taxonomy of such 
configurations together with illustrations of existing research The main 
criterion for my classification relates to what aspects of a social form are 
actually treated as endogenous features Whereas some models are limited 
to the generation of behavioral patterns, others feature a much more 
profound “rewiring” of social reality, including the context and consti- 
tution of the main actors Based on these principles, four levels of en- 
dogenization can be identified ın ascending order of ontological depth 
According to this scheme, social forms can be modeled as 


1 Emergent interaction patterns constituting structures of behavioral 
choices of microlevel agents, usually in space 

2 Emergent property configurations constituting arrangements of 
agents’ microlevel properties, such as their culture or attitudes 

3 Emergent networks constituting dynamic configurations of relations 
that modify agents’ ability or inclination to interact with other actors 

4 Emergent actors constituting both individual and collective agency, 
including actors’ outer boundaries and their internal organization 


Rather than being mutually exclusive, these types of social forms can 
be combined, and they often appear recursively For example, models that 
generate property configurations frequently endogenize behavioral inter- 
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action configurations as well By the same token, models featuring dy- 
namic boundaries often require endogenization of the relevant interaction 
networks 

As shown by Macy and Willer’s (2002) comprehensive review of recent 
sociological applications, most existing agent-based models treat social 
forms as interaction patterns or property configurations, while keeping 
the interaction topology and the actors’ corporate identities fixed These 
two first categories correspond to what Macy and Willer refer to as models 
of “emergent order” and “emergent structure,” respectively Studies ex- 
plaining behavioral aspects of social systems remain the most active re- 
search area 1n agent-based modeling Following the path-breaking work 
by Axelrod (1984), the literature on behavioral interaction patterns has 
centered on explaining the emergence of cooperation ın anarchic settings 
As ıs well known, Axelrod’s main result indicates that cooperative out- 
comes can result from egoistic adaptation even 1n the absence of central 
enforcement Subsequently, this work has spawned an extensive com- 
putational literature on the dynamic conditions of cooperation (for re- 
views, see Axelrod and Dion 1988, Macy 1998, Hoffmann 2000, Axelrod 
2000, Macy and Willer 2002) 

It should be noted that these cooperation-theoretic models primarily 
serve other purposes than generating social forms Instead, the outcome 
dimension ıs typically limited to a one-dimensional statistic measuring 
the level of cooperation 1n the system (see Macy and Willer 2002) To the 
extent that social forms do emerge, however, they are usually seen as side 
effects of this primary goal Illustrating this point, Axelrod (1984, chap 
8) devoted an often-overlooked chapter of his celebrated book to the 
*structure of cooperation " Noting that cooperative strategies stand a bet- 
ter chance of invading noncooperative populations 1f they appear m clus- 
ters, Axelrod studied the geographic distribution of collaborating agents 
(see also Cohen, Riolo, and Axelrod 2001) Although Axelrod’s book pri- 
marily focuses on growing cooperation ın general, the spatial configuration 
of cooperative strategies exemplifies social forms seen as emergent inter- 
action patterns ” 

The generation of social forms appears to be a more central goal in 
studies that explore explicitly emergent property configurations, such as 
cultural traits and attitudinal dispositions Schelling’s (1978) classic model 
of segregation remains the best-known example As we have already seen, 


? If one generalizes the spatial logic from geographic locations to strategic space, Lom- 
borg’s (1996) study of strategic configurations involving strategy mixes serving as 
“nuclei” and “shields” could also be classified as one that generates behavioral ınter- 
action configurations See also Cederman (2001) for a model that shows that cooperative 
configurations can emerge among democratic states 
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this model generates a spatial configuration of actors possessing dichot- 
omous “ethnic” traits By providing numerous references to similar models, 
Macy and Willer (2002) demonstrate that this 1s still a very lively field of 
research, especially in sociology Many models omit Schelling’s focus on 
movement 1n favor of permanently located social actors that influence 
each other locally (eg, Carley 1991, Mark 1998) As 1s the case with 
emergent interaction models, researchers typically focus on clustering 
Axelrod's (1997, chap 7) well-known culture model, for example, gen- 
erates distinct spatial clusters of agents with identical traits despite the 
presence of a homogenizing microlevel mechanism More generally speak- 
ing, the main sociological insight that can be gained from these and similar 
studies 1s that macrolevel heterogeneity can result from homophilic mı- 
crolevel rules (Macy and Willer 2002, p 13) 

Although these two categories of social forms dominate the computa- 
tional literature, 1t would be a mistake to believe that they exhaust it 
Based on the process-theoretic critique of “substantialism,” st 1s clear that 
configurative explanations often require a more flexible ontology than 
that evidenced by the two first categories of emergent social forms In 
this sense, one could fault Macy and Willer’s (2002) otherwise excellent 
review for adopting too shallow a notion of *social structure " On the 
other hand, the limited scope of their review 1s understandable given the 
scarcity of attempts to generate deeper social forms Fortunately, however, 
computational modeling ın no way precludes endogenization of such so- 
ciational configurations, although the costs 1n terms of complexity may 
discourage some researchers from venturing into this domain Indeed, the 
Simmelian categories of emergent networks and actors remain compar- 
atively understudied by computational researchers, but as we will see, 
they are far from empty 


EMERGENT NETWORKS 


Whereas networks figure in the two previous types of studies, research 
that treats networks as emergent social forms requires that interaction 
topologies be not only explicitly represented, but also endogenized The 
distinction hinges on the endogenization of interaction opportunities, or 
what game theorists refer to as the “game form ” There 1s a fundamental 
difference between studying dynamics oz networks and dynamics of net- 
works (Watts 2003) In this section, I focus exclusively on the latter 
Because their heritage goes back to cellular automata, traditional agent- 
based models typically rely on grids and thus satisfy the requirement of 
explicit representation (Troitzsch 1997), but they seldom live up to the 
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requirement of endogenized interaction configurations *° In fact, square 
lattices can be seen as a special case of networks, which 1s a much more 
general class of social forms and thus more suitable for model building 
in the process-theoretic tradition 

In mathematical sociology, there 1s a very rich literature on social net- 
works that dates back several decades (see e g , Wellman 1983) Still, partly 
because of the specific analytical tools employed, much of this scholarly 
activity has focused almost entirely on static characteristics of network 
structures (Watts 2003, p 50) To the extent that dynamics are explored 
at all, tractability constraints force analysts working with models to as- 
sume stationarity and homogeneity (Zeggelink 1994, see also Sawyer 
2003) Relying on his own object-oriented code, Zeggelink (1994) was the 
first network theorist to apply agent-based modeling to the analysis of 
dynamic friendship networks (see also Conte et al 1998, Skvoretz and 
Fararo 1996) Today, standard computational packages, such as Repast 
and Swarm, offer built-in software for this purpose 

While these developments bring modeling into closer harmony with 
process theory, the main focus 1n the sociological literature remains on 
the dynamics of small, interpersonal networks In contrast, Sımmel”s for- 
mal sociology stresses large-scale forms as much as smaller ones Fortu- 
nately, 1n the last few years, statistical physicists have brought their an- 
alytical skills to bear on large social networks (Albert and Barabâsı 2002) 
Even though the first steps ın this direction concerned dynamics on net- 
works—as evidenced by Watts and Strogatz's (1998) pioneering work on 
the small world problem—more recently, physicists have turned their 
attention to dynamic configurations Such settings may converge to an 
equilibrium, but they may also feature nonequilibrium networks, to which 
vertices and edges are added or from which they are removed (Dorogov- 
tsev and Mendes 2002, p 1086) 

The large scale of the physicists’ models enables the researchers to 
explore emergent macroeffects that cannot be detected 1n smaller systems 
This 1s crucial, because complex networks exhibit general patterns and 
regularities that allow us to talk of “universality classes " Under such 
conditions, macrolevel effects occur irrespectively of the particular mı- 
crolevel mechanisms involved “This 1s a tremendously hopeful message 
for anyone interested 1n understanding the emergent behavior of complex 


10 Tn an indirect sense, it could be argued that Schelling’s (1978) segregation model 
features these types of entities, for in his framework, the actors’ mobility implies that 
their interaction topology evolves over time Likewise, some behaviorist interaction 
models feature exit and other aspects of voluntary partner selection that implicitly 
produce network structures (eg, Majeskı 1999, for references to other examples, see 
Macy 1998, Hoffman 2000, Axelrod 2000) 
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social and economic systems like friendship networks, firms, financial 
markets, and even societies” (Watts 2003, p 65) 

It is not hard to see the parallel to Simmel’s aforementioned distinction 
between form and content Anticipating the principles of modern statis- 
tical physics, Simmel believed that social forms vary more or less ınde- 
pendently of the contents of social interactions (Levine 1991, p 1104) 
This assumption makes 1t possible to study the formal “geometry” of social 
interactions while ignoring many of their microlevel details In this sense, 
computational network models are ideally suited to operationalize Sim- 
mel’s formal sociology 

In fact, the computational tools allow for a much more precise con- 
nection of the micro- and macrolevels than expected by Simmel Com- 
plexity theory suggests that many social forms are associated with distinct 
statistical footprints ın terms of their degree distributions, transitivity, or 
other formal properties Such patterns can be used as the starting point 
for generative theorizing that serves to uncover the mechanisms that are 
responsible for their emergence For example, researchers have been puz- 
zled by the highly skewed distribution of connectedness among Web sites 
on the Internet To put 1t more exactly, the degree distribution follows a 
power law (for a popular introduction, see Buchanan [2002]) To account 
for this asymmetry, Albert and Barabası (2002, p 71) ask “What ıs the 
mechanism responsible for the emergence of scale-free networks?” It turns 
out that a very simple model can generate power-law-distributed config- 
urations Barabási and Albert (1999) created such a network by contin- 
uously adding nodes and connecting the new nodes to the previous ones 
such that the new links exhibit “preferential attachment ” This term means 
that new links are added 1n proportion to each node’s popularity as mea- 
sured by its current number of links 

Obviously, not all social networks exhibit scale-free connectivity This 
property appears to follow from topologies that impose httle or no cost 
on the addition of new nodes Where there are constraints applied by 
geography or other factors, the dynamic process of network formation 
and evolution may well generate emergent social forms characterized by 
other skew distributions, such as log-normal or exponential distributions 
(Jin, Girvan, and Newman 2001) The most recent studies 1n the physics 
hterature have attempted to identify what distinguishes social networks 
from other types of networks Newman and Park (2003) postulate that ıt 
1$ the subdivision into groups and communities that produces the key 
macrolevel properties of social networks (see also Watts 2003, pp 118— 
29) In particular, network configurations could emerge thanks to “as- 
sortative mixing,” not unlike the homophilic property configurations dıs- 
cussed above (Newman 2003) A related postulate attributes scale freedom 
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and clustering to the hierarchical organization of complex social networks 
(Ravasz and Barabâsı 2003) 

Despite this impressive progress, considerable distance remains between 
statistical physics and Simmelian process theory This 1s so partly because 
the models fail to endogenize the actors’ identities, even 1n cases where 
they are explicitly represented Yet ın real-world social systems, ındıvıd- 
uals coevolve with network structures to such an extent that ıt 1s hard 
to separate individual “plasticity” from network “elasticity” (Lazer 2001) 
Few, 1f any, models combine emergent property configurations with dy- 
namic networks " Moreover, endogenizing identities entails more than 
accounting for actors’ properties, cultural or otherwise, because Simmel’s 
notion of individuality requires a more profound notion of identification 
It 1s to this challenge that I turn ın the next section 


EMERGENT ACTORS 


The models associated with the three previous categories of social forms 
subscribe to individualism in the sense that mdividual actors are assumed 
to exist at the outset of the analysis and to persist until its conclusion To 
be sure, such a substantıalıst assumption simplifies model construction, 
yet 1t contradicts Simmel’s ontological principles that require not only 
structures, but also actors, to be socially constructed To follow the prin- 
ciples of sociological process theory, then, 1t 1s necessary to generate actors 
as emergent social forms This in turn requires that the actors be prob- 
lematized in terms of their “corporate identities” (Cederman and Daase 
2003, cf Wendt 1999) 

How could agent-based modeling circumvent the need to postulate a 
set of reified actors at the outset of the modeling process? Drawing ex- 
plicitly on sociological process theory, Abbott (1995) shows how to avoid 
the trap of anthropomorphic extrapolation from biological individuals 
The trick 1s to focus on potential boundary elements before assuming the 
presence of the actors to be generated Typically, such “sites of difference” 
are formed 1n a “soup” of microlevel actors “Previously-constituted actors 
enter interaction but have no ability to traverse the interaction inviolable 
They ford it with difficulty and in 1t many disappear What comes out 
are new actors, new entities, new relations among old parts” (Abbott 1995, 
p 863) 

Abbott’s scenario dovetails with Simmel’s theory of how collective ac- 


" Though see also Padgett (1997), who proposes a model ın which network skills emerge 
dynamically, and Macy et al (2002), who take a step 1n this direction by devising a 
computational model that features self-organizing groups in an Hopfield attractor 
network 
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tors emerge ın reaction to external threats Simmel distinguishes between 
cases ın which the cohesion of an already existing group increases as ıt 
enters into an antagonistic relationship with another group, and those 
cases where there was no preexisting group consciousness before conflic- 
tual interaction “Conflict may not only heighten the concentration of an 
existing unit, radically eliminating all elements which might blur the dis- 
tinctness of its boundaries against the enemy, it may also bring persons 
and groups together which have otherwise nothing to do with each other” 
(Simmel [1908] 1955, pp 98-99) ” 

Fortunately, at least some aspects of these 1deas can be translated readily 
into computational language In a generic model of ecological morpho- 
genesis called Echo, John Holland (1995) lets “primitive agents" amal- 
gamate into “multıagents” through a process of boundary formation where 
lower-level agents are able to merge into composite entities, thus assuming 
the status of "agent-compartments" (see also Cederman 2002) Such an 
organizational transformation requires that explicit rules of action scope 
and resource transfer be specified This ıs why some type of internal 
organizational structure 1s needed to hold together and coordinate the 
activities of the agent-compartments 

Axtell’s firm-size model provides a particularly concise and instructive 
example of this approach to the formation of actors Drawing on work 
by physicists, Axtell (1999) notes that the size of real-world companies 1s 
power-law distributed Treating this phenomenon as an emergent statistic 
reflecting a specific social form, his generative approach accounts for the 
mechanisms producing ıt By postulating straightforward rules of joining 
and leaving firms, Axtell ıs able to generate scale-free aggregate behavior “ 

Featuring a similar, though inherently more complicated, logic of or- 
ganizational genesis, another line of research focuses on properties of state 
systems ın world politics Already in 1977, Bremer and Mihalka (1977) 
introduced a model featuring conquest in a hexagonal grid, later extended 
and further explored by Cusack and Stoll (1990) Cederman (1997) intro- 
duced a new generation of models in the same tradition These models 
share a common architecture that starts with a territorial grid of fixed 
and indivisible primitive agents that can be thought of as villages or 
counties The states that survive grow, and their boundaries expand en- 
dogenously through a repeated process of conquest The resulting states 
become hierarchical organizations linking capitals to their respective prov- 


Coser (1964) offers an influential, functionalist interpretation of Simmel’s conflict 
hypothesis that reduces it to a behavioral phenomenon applying to relations among 
exogenous actors For a critique of this perspective, see Sylvan and Glassner (1985) 

“ Beyond the specific issue of firm sizes, statistical mechanics continue to inspire gen- 
erative model building that endogenizes actor boundaries (see e g , Dittrich et al 2000) 
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inces through direct, asymmetric relations of domination It should be 
noted that although the agents reside 1n a grid-based space, the underlying 
organizing principles presuppose that the actors be organized as a dynamic 
network, for conquest inevitably changes the list of neighbors Models of 
this type can be used to explore dynamic features of competitive geo- 
political systems, such as the duration of balance of power (Cusack and 
Stoll 1990) and war-size distributions (Cederman 2003) 

The examples covered so far mclude only two-level organizations How- 
ever, computational organization theorists have gone beyond this lımı- 
tation by analyzing multilevel networks explicitly Whereas most of the 
hterature investigates the properties of fixed social forms, some studies 
set out to grow organizational structures (Carley 2002) In a prominent 
example, Carley and Svoboda (1996) apply simulated annealing as a way 
to represent organizational adaptation in terms of restructuring and learn- 
ing Recent computational organization theory fits especially well 1nto the 
generative research program because it increasmgly attempts to solve 
“backward” rather than “forward” problems (Lom: and Larsen 2001) 

Do actor configurations generated in these illustrative models exhibit 
emergence? 'The answer to this question hinges on which class of emer- 
gence 1s aspired to If the goal 1s to generate emergent patterns ın the 
“bottom-up” sense, the question can be answered ın the affirmative, at 
least when 1t comes to specific instances of the social forms For example, 
in Bremer and Mihalka’s (1977) model, ıt 15 impossible to predict what 
the states’ boundaries are going to look like based on inspection of the 
mıcrolevel rules As regards entirely new organizational forms, however, 
it ıs less obvious that existing research generates even individualist emer- 
gence Axelrod (1997, chap 6) proposes a “tribute” model of new political 
actors that may be at least a partial exception to this observation Ac- 
cording to Axelrod’s algorithm, collective actors emerge 1f there 1s a pat- 
tern of interactions that confirms a number of properties seen to be con- 
stitutive of agency These include effective control over subordinates, 
collective action, and recognition by third parties that an actor has been 
formed 

Nevertheless, intrinsic emergence calls for explicit representations of 
organizational forms inside the actors’ “internal models,” that 1s, their 
cognitive maps of themselves and their environments (Holland 1995) 
Because of their relative simplicity, the agents ın all of the aforementioned 
cases fail to live up to this standard Whereas there are models that exhibit 
downward causation, ın these cases, the causal pattern 1s not emergent 
in itself To date, the computational literature lacks emergent models that 
“are both micro-to-macro and macro-to-micro modeled simultaneously” 
(Sawyer 2003, p 347) 

Thus, 1t has to be concluded that a considerable gap remains between 
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the state of the art of agent-based modeling and sociological process theory 
To close this gap, model building must confront the challenge of devel- 
oping agents with internal models that recognize emergent effects while 
at the same time predicating their actions on them All this has to be done 
without resorting to reification of the emergent patterns themselves As 
explained by Sawyer (2002a, p 554), philosophers refer to such phenomena 
as “supervenient” in that these “higher-level entities and properties [are] 
grounded in and determined by the more basic properties of physical 
matter” without being reducible to the latter We are thus quite far from 
realizing this ideal 1n contemporary agent-based frameworks, but richer 
cognitive models employing agent communication languages and some 
developments 1n Artificial Life should give us hope that such a project 1s 
indeed feasible “ 


CONCLUSION 


What can be concluded from this attempt to connect sociological process 
theory with agent-based modeling? First of all, my comparison has illus- 
trated that the two research traditions exhibit striking similarities At the 
epistemological level, both shun nomothetic notions of causation 1n favor 
of a generative scheme of explanation In terms of ontology, both bodies 
of theory put processes of social forms at the center of the research agenda 
rather than skipping directly to analysis ın terms of causal relations among 
variables 

What can process theorists learn from agent-based modeling? First and 
foremost, this survey should convince them that formal modeling does 
not necessarily entail abandoning generative and configurative explana- 
tions Indeed, the agent-based paradigm offers a viable alternative that 
is fundamentally compatible with process-theoretic foundations, serving 
as an important corrective not only to contemporary rational-choice theory 
and statistical methods, but also to variable-based reinterpretations of 
Simmel, such as Coser (1964) and Levine (1991) Second, given the massive 
complexity of macrohistorical processes, there 1s an acute need for an 
“accounting mechanism" that assists the theorist ın performing thought 
experiments and counterfactual scenarios involving interacting subpro- 
cesses (Cederman 1997, chap 3) The formalism also helps theory builders 
sharpen abstract concepts and articulate otherwise hard-to-grasp mech- 
anisms Finally, thanks to their formal precision, computational models 
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sometimes generate characteristic statistical footprints that can be used 
as generative targets 1n the search for candidate mechanisms 

What can computational modelers learn from the process tradition ın 
sociology? Although the sociological hterature often appears obtuse, this 
article has highlighted the value of consulting the classics and keeping 
abreast of some of the most recent developments ın social theory By virtue 
of their “head start,” sociological theorists have already had ample time 
to confront some of the most tricky issues 1n social theory Their insights, 
especially those relating to generativity and social forms, could counteract 
the strong influence of positivist and individualist perspectives on mod- 
elers, particularly 1n economics Though often manifested abstractly, the 
macrosociological literature on process also contains a wealth of empirical 
and historical examples which are bound to inspire computer-assisted 
theorizing Even more crucially, however, comparison with the process- 
theoretic literature highlights theoretical challenges that have only been 
partially dealt with by computational methodologists Without such con- 
ceptual guidance, ıt would be tempting to content oneself with models 
that generate social forms as emergent interaction patterns and property 
configurations Instead, a careful reading of sociational theory tells us that 
model building could also endogenize networks and actors While much 
work remains to be done until the computational repertoire of models 
finally generates intrinsic emergence 1n accordance with Simmel’s classic 
strictures, the agenda has been set for future conceptual challenges 

Ultimately, theoretical innovation advances through analogy-driven 
transfers of knowledge and insights from one conceptual domain to an- 
other The well-established marriage of evolutionary game theory and 
computational modeling has already proven extraordinarily fertile Now 
it 1s time to explore what the new union between sociological process 
theory and computational modeling can offer 
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Small and Other Worlds: Global Network 


Structures from Local Processes! 


Garry Robins, Philippa Pattison, and Jodie Woolcock 
University of Melbourne 


Using simulation, we contrast global network structures—in par- 
ticular, small world properties—with the local patterning that gen- 
erates the network We show how to simulate Markov graph dis- 
tributions based on assumptions about simple local social processes 
We examine the resulting global structures against appropriate Ber- 
noulli graph distributions and provide examples of stochastic global 
“worlds,” including small worlds, long path worlds, and nonclustered 
worlds with many four-cycles In light of these results we suggest a 
locally specified social process that produces small world properties 
In examining movement from structure to randomness, parameter 
scaling produces a phase transition at a “temperature” where regular 
structures “melt” into stochastically based counterparts We provide 
examples of “frozen” structures, including “caveman” graphs, bi- 
partite structures, and cyclic structures 


INTRODUCTION 


It matters that networks differ To justify this claim, we need look no 
further than the compelling description of the rise of the Medici by Padgett 
and Ansell (1993) They argue that Cosimo de Medici came to power ın 
15th-century Florence in large part because the Medici family was at the 
center of a starlike structure of marriage and business alliances, a pat- 
terning of interfamihal relations that could be efficiently activated by— 
and only by—the Medici In contrast, the dense interconnections among 
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Cosimo’s political opponents, the traditional oligarchic families of Flor- 
ence, hindered a decisive response In the crucial showdown, the oligarchic 
opposition to Cosimo splintered into confusion and mistrust Because of 
their dense pattern of ties, the oligarchs knew who should take the field, 
so absences were treated as defection The Medici clients knew too little 
to feel betrayed They stayed, the oligarchs fled 


Global Network Structure and Local Network Processes 


Networks differ, but 1n what sense? It 1s the global structure of the Flor- 
entine network that counts ın the description above The argument rests 
on the patterning of relations among all families By global structure we 
refer to features that can only be determined by examining the entire 
network (or at least large parts of ıt) In the Florentine families network, 
the juxtaposition of large patterns—a starlike structure adjacent to a dense 
cliquelike pattern—cannot necessarily be surmised by considering ındı- 
vidual nodes or their 1mmediate network neighborhoods, that 1s, more 
local subnetworks 

In this article, we contrast such broad global features with the local 
patterning that could give rise to them Our representation of local struc- 
ture relies on small network configurations that we describe below as 
subgraphs involving only a few network ties We construe these config- 
uratıons as the outcomes of local social processes Network ties emerge, 
persist, and disappear by virtue of actions made locally at the scale of 
the individual actors 1n a network (whether they be persons or families 
or companies or some other social entity) Actors do not usually cast their 
gaze across the entire network, possibly because 1n most cases they can 
only “see” what 1s 1n their local social neighborhood (Pattison and Robins 
2002) On the basis of their localized view, they form strategies and make 
decisions that intersect with those others who are socially proximate 
Combinations of these competing or complementary intentions and ac- 
tions constitute social processes that make up local patterns of relation- 
ships These local patterns agglomerate to create the global structure The 
strength of Padgett and Ansell's account of the Medici lies ın showing 
how the localized processes—involving individual marriage and business 
partnerships within and across neighborhoods of medieval Florence—led 
to the global features that mattered 


The Role of Computation and Simulation 


Because we can construe global structure as an agglomeration of local 
patterns, an understanding of the global can be greatly enhanced by 
computational and simulation techniques It 1s not inherently difficult to 
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infer the local structural outcome of a simple postulated local social pro- 
cess (eg, as discussed below, a tendency to structural balance leads to 
localized triadic structures) What 1s not apparent, however, are the global 
outcomes when several of these localized patterns combine, perhaps with 
different strengths Indeed, except 1n a small number of cases (examples 
are discussed below), computational techniques are necessary because 
analytic solutions are simply not available 

The advent of computer technology enables us to traverse this con- 
ceptual and methodological gap from individual local patterns to the 
possibility of various global phenomena This is potentially a two-way 
process We observe certain global properties of interest 1n a network (e g , 
small world properties as discussed below) We can then speculate how 
these properties might emerge from localized social processes, develop a 
model embodying such processes, and then use simulation procedures to 
investigate the global outcomes of this model If the results mirror the 
global properties of interest, we can then check whether some or all of 
the postulated local processes are operative in the observed network 

In other words, we do not see observation, modeling, and simulation 
as inherently different approaches toward understanding, but rather a 
potentially powerful combination of tools that readily complement each 
other In this article, we concentrate on the results of simulating from 
some simple but plausible models, but we are mindful of the need to 
subject these models to stringent empirical testing For instance, one step 
that 1s needed to go beyond this discussion 1s to show that the processes 
inherent ın the models are actually observed ın empirical networks ' 


Which Global Properties? 


Despite the possibility that local patternings shape global properties of 
the network, ıt 15 not always clear which global features are best to ex- 
amine, or indeed simply how best to describe global network structure 
Sometimes, as ın the Medici example, important global qualities seem 
apparent, but if we consider the range of networks that Faust and Skvo- 
retz (2002) attempted to compare, 1t 1s not at all obvious what the optimal 
criteria are Even so, the recent flurry of activity on small world networks, 
following the seminal work of Duncan Watts (Watts and Strogatz 1998, 
Watts 1999a, 19992), illustrates just how much can be gained by inves- 


* For at least some observed networks, we have evidence that the models described 
below are indeed plausible descriptions, not just on the basis of measures of fit, but 
also because important global properties can be reproduced (Robins 2003) 

* At least once Padgett and Ansell (1993) have done all the hard exploratory data 
analysis to reveal the important structural features 
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tigating one or two global features that seem of general relevance, 1n this 
case the prevalence of short paths ın networks ın the presence of some 
randomness Average path length 1s clearly a global feature of a network 
What local processes could make it “short”? 

Following Watts's innovative approach, there has been an upsurge of 
exciting research into global network phenomena with a focus on prop- 
erties (described 1n more detail below) such as path length, clustering, and 
degree distribution (e g , Amaral et al 2000, Barrat and Weigt 2000, Bar- 
thélémy and Amaral 1999, Bohland and Mınaı 2001, Comellas, Ozon, 
and Peters 2000, Kirillova 2001, Mathias and Gopal 2001, Newman 2000, 
Newman and Watts 1999, Pandit and Amritkar 2001, Pastor-Satorras and 
Vespıgnanı 2001, Yang 2001) Much of this research concentrates on large- 
scale networks, including growing networks such as the Internet One 
advantage of considering large networks with indeterminate numbers of 
nodes 1s that asymptotic results may sometimes be available analytically, 
at least for simpler models Strogatz (2001), Albert and Barabası (2002), 
and Newman (2003) provide excellent reviews of this burgeoning 
literature 


Locally Specified Network Models 


Despite 1ts innovation and rigor, the literature on global network phe- 
nomena gives at best passing attention to the local processes that might 
generate most real networks Few of the models can be locally specified 
The original model of Watts (1999a) starts with an exogenous global 
structure (a large circle through all nodes, termed by Watts a cyclic sub- 
strate and also known more technically as a one-dımensıonal lattice) and 
then introduces some local processes that add to that structure (random 
edge addition combined with a tendency toward clustering, or triangle 
formation) In the short time since its introduction, this basic model has 
been used and elaborated by many others, clearly constituting an ımpor- 
tant new approach to understanding global features But with an exog- 
enous global structure as its starting point, ıt 1s not locally specified, and 
so cannot tell us how small worlds might emerge ın real networks * 
Another well-known recent model 1s that of preferential attachment 
(Barabâsı and Albert 1999), where for a growing network a new node 1s 
attached to existing nodes with a probability dependent on the degree 


* Watts has more recently developed new locally specified models to describe the results 
from the original small world experiment of Milgram (1967), recognizing that his cyclic 
substrate model does not allow for strategic search by actors for effective network 
partners (Watts, Dodds, and Newman 2002) Indeed, without allowing for a search 
capacity, there 1s doubt that the original model can account for actors’ use of short 
paths, given that they only have local information available to them (Kleinberg 2000) 
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distribution of existing nodes This model can be shown to result ın a 
power-law degree distribution with resulting scale-free properties Here 
we have more of the flavor of a locally specified model, but to determine 
the probabilities of attachment, the new node has to “know” the existing 
degree distribution across the entire graph This model has considerable 
strengths 1n describing the structure of the Internet, which 1s indeed an 
expanding network But we disagree with the claim by Albert and Bar- 
abâsı (2002) that most real-world networks describe open systems that 
grow by the contmuous addition of new nodes Such a claim ignores the 
large amount of empirical work ın a range of literatures—from the an- 
thropological to the organizational—that searches for an understanding 
of small-scale, relatively closed network systems For instance, the mem- 
bership of the Florentine elite was relatively stable, in comparison to the 
changes in their interrelationships resulting from ongoing social and eco- 
nomic processes 

Of course, simple random graph models, arising originally from the 
work of Erdos and Renyı (1959, see also Gilbert 1959), are much closer 
to local specification Two simple random graph models have simular 
properties In the first, L edges are added at random to a graph of n nodes, 
sometimes referred to among social network analysts as the U|L model 
(the uniform distribution of random graphs with a given number of nodes, 
conditional on the presence of L edges) The second model posits that an 
edge between pairs of nodes occurs independently and with fixed prob- 
ability P, sometimes referred to as the Bernoulli graph distribution (Frank 
1981, Frank and Nowicki 1993) The local specification of a Bernoulh 
graph 1s clear because the probabihty of a tie between two nodes 1s 1n- 
dependent of anything else 1n the graph If friendship networks could be 
reasonably described as Bernoulli graphs (which they cannot), we could 
claim that any pair of humans has a given propensity to friendship, 1r- 
respective of whatever other friendships may occur A local specification 
permits a ready translation into a process or behavior at the level of the 
actors, the nodes ın the network 

Local specifications generally assume a level of homogenezty, that there 
are some effects observed locally that are reproduced across the entire 
network (for instance, a fixed P for a Bernoulli graph) A homogeneity 
constraint assumes a systemwide property, but 1n almost all social net- 
works, the only intentionality in. the system 1s at the level of the actor, 
that 1s, locally (Robins and Pattison 2001) Similar arguments provide the 
theoretical underpinning for Snijders's actor-oriented models (Snyders 
1997, 2001, Van De Bunt, Van Duyn, and Snyders 1999) As Robins and 
Pattison (2001) argued, the systemic property inherent ın a homogeneity 
constraint may reflect shared norms or behaviors across actors, norms or 
behaviors that could be construed as inherent in the particular social 
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relation for this group of people Modeling based on local effects presup- 
poses some such level of sharedness of local behaviors across the system 
(otherwise modeling 1s impossible) Randomness ın such models arises 
from nonshared local behaviors 

The extent of homogeneity ıs usually an unheralded issue Much social 
network modeling assumes at least homogeneity of nodes”—that 1s, the 
identity of nodes 1s irrelevant to the model, so that the model does not 
presuppose that one node is more likely than another to have, for instance, 
a given degree Node homogeneity 1s also a feature of the Watts (1999a) 
cyclic substrate model, although not of the preferential attachment model, 
where the nodes are ordered across time 

A purely local specification 1s not always appropriate Clearly there 
may be exogenous effects that operate at a global level (Robins and Pat- 
tison 2001) The CEO of a company may impose an organizational re- 
structuring, television does shape the transmission of information, a com- 
puter company with a large market share may bundle certain Web site 
addresses as preset features into its Internet vehicles These exogenous 
actions are not local processes Even so, they may influence but not de- 
termme the underlying network effects The restructuring may not work, 
and workers may revert to some of their old alliances, the television news 
may be doubted and different information transmitted locally, the com- 
puter user may choose not to utilize some features of the package So even 
when exogenous effects are apparent, some form of local specification may 
still be required 

In this article, we use Markov random graph distributions (Frank and 
Strauss 1986)—described 1n detail below—as a basis for locally specified 
models We have three principal aims (4) to establish methods to assess 
and compare global structures for dzstrzbutzons of small, possibly non- 
connected, graphs, (b) to use simulated Markov graph distributions to 
investigate various types of global structures that may arise, using dıf- 
ferent parameter values that can be related to a few very simple postulated 
local processes, and (c) to illustrate how the scaling of parameters affects 
the transition from randomness to more regular structure We shall give 
examples of what we call small world distributions of graphs with a given 
number of nodes, but we shall also show how to simulate distributions 
of long path worlds, of relatively dense graphs with no clustering, and of 
“degenerate,” highly structured worlds, such as the caveman graph of 
Watts (1999a) 


5 Except m cases where some form of blockmodeling might be the objective of the 
analysis (e g , Nowicki and Snyders 2001), or where the modeling explicitly includes 
observed or latent actor attributes (e g, Hoff, Raftery, and Handcock 2002, Lazega 
and Van Duyn 1997, Robins, Elliott, and Pattison 2001) 
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Our third aim of parameter scaling can be distinguished from the scale- 
up problem, which concerns the change in global outcomes as the number 
of nodes in the network grows Although the scale-up problem has a long 
tradition in random graph theory (Bollobas 1985), we concentrate here 
on the scaling up of parameter values 1n networks with a constant number 
of nodes * The parameter scale-up 1ssue has been ignored ın the existing 
graph literature but we demonstrate that ıt 1s inherently connected with 
the transition from randomness to regularity 1n graph structures (Watts 
1999a) We make some comments on the impact of the size of the node 
set ın our conclusions, as a matter for ongoing work 

Our focus 1s the nexus between local specifications and global prop- 
erties Small world aspects are just one instance but as they have featured 
so prominently ın recent work and have been so helpful ın permitting 
new ways of thinking about local-global connections, ıt 1s useful to ın- 
troduce our approach specifically through a small world lens In what 
follows, then, we begin with a discussion of certain aspects of small world 
problems We then introduce our simulation strategy, including a descrip- 
tion of exponential random graph (?”) models, of which Markov random 
graphs are a subclass We give examples of the various global structures 
that emerge with different parameter values We go on to discuss the 
effect of increasing parameter values, with examples of the resulting reg- 
ularized structures We conclude with a discussion of further work to 
elaborate our general approach 


THE SMALL WORLD 


Stanley Milgram (1967) famously concluded that the median number of 
intermediaries required for one person in the United States to contact 
another person was five Despite some important work in the intervening 
years (e g , Kochen 1989, Pool and Kochen 1978), small world ideas rather 
languished until given new currency in the 1990s (Watts and Strogatz 
1998, Watts 1999a, 1999b) A social network of acquaimtances can be 
represented as a graph, that 1s, a structure comprising modes and edges 
that connect pairs of nodes ’ Watts (1999a) specified the properties of small 
world networks ın graph-theoretic terms (defined ın detail ın the ımme- 
diate paragraph below) a small world graph has low density and 1s highly 
clustered but has short characteristic path lengths Watts (19995) noted 
some additional relevant properties about the human social world the 


? As noted earlier, the vast bulk of empirical network analysis focuses on networks 
with a fixed number of nodes 


" In this article, we focus on nondirected networks 
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number of nodes 1s very large, and the network 1s highly decentralized 
in that there 1s no dominant node to which other nodes are directly con- 
nected (or adjacent) As he noted, the juxtaposition of all of these char- 
acteristics seems counterintuitive, yet they are features of many empirical 
networks, including social networks (see also Albert and Barabâsı 2002) 


Terminology 


Before discussing Watts’s approach to constructing a probabilistic model 
for small world graphs, we first review some graph-theoretic and social 
network terminology 


1 


Çə 


9 


A graph, G = (N, E), comprises a set N of nodes and a set E C 
N x N of edges, each of which connects a pair of distinct nodes 1n 
N (we say that the edge e = 77 connects nodes z and 7) 

The order of a graph 1s the number z of nodes in N 

The seze of the graph ıs the number L of edges in E 

The density of the graph ıs the number of observed edges as a 
proportion of the total possible number of edges, namely, 2L/n(n — 
1) 

The degree of a node 1s the number of edges incident to 1t, so that 
if there are L edges in a graph of » nodes, the average degree per 
node k equals 2L/z, and the density equals k/(n — 1) 

The degree distribution for a graph is the vector (d,, d,, l 
d, -,) of degree frequencies, where d, 1s the number of nodes having 
degree k 

G' = (N', E) 1s a subgraph of Gif N! C Nand E C E 

A g-star ıs a subgraph of (q + 1) nodes ın which one central node 1s 
connected by an edge to exactly q other nodes 

A triangle 1s a subgraph comprising a set of three nodes (a triad), 
each pair of which 1s connected by an edge 


10 A path of length m 1s a sequence (e,, e,, , €,4,.1) Of edges such 


11 


12 


13 


In 


that e, = 1,7,, 1,,, = Jm and all nodes z, and 7, are distinct 

A geodes: between two nodes 1s the shortest path between them, 
taken to have infinite length if there 1s no path between the two 
nodes 

A connected graph has paths between all pairs of nodes (1e , no 
geodesics of infinite length) 

The local clustering coefficient C, of a node z 1s the proportion of 
pairs of nodes to whom node 2 1s connected that are connected to 
each other 


general, clustering in a graph refers to a propensity for pairs of nodes 


to be connected to each other if they are connected to a common third 
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node * The characteristic path length (M) of a graph 1s a measure of central 
tendency (across the graph) of the length of geodesics Watts (1999a) de- 
fined clustering and characteristic path length 1n the following terms 


the clustering coefficient C 1s the average value across nodes of the local 
clustering coefficient C,, 


the characteristic path length M 1s the mean geodesic length 


Clearly the small world phenomenon relates to characteristic path 
length Some major results relating to path length and connectedness are 
well known in graph theory (Bollobas 1985) For U|L and Bernoulli graph 
distributions, the Erdos-Renyı theorem (Erdos and Renyı 1959) establishes 
that almost all simple random graphs with more than zn (n)/2 edges are 
connected Other important results used by Watts (1999a) include that, 
for large » and £, the characteristic path length, M, for a simple random 
graph ıs of the order of In(m)/In(&), and C ıs of the order of k/n So for 
low-density graphs, the tendency for clustering in a random graph is small 
Moreover, M for random graphs 1s typically short So when a graph 1s 
an agglomeration of independent, randomly added edges, we expect short 
characteristic path length but httle tendency for clustering But certain 
highly structured graphs, such as Watts’s (19998) “connected caveman 
graph" (a graph of small clustered components connected 1n one large 
cycle), have high levels of clustering with long characteristic path length 


Simulating Random Graph Distributions 


Watts's (19998) argument used connected graphs (1e , with finite geodes- 
ics) In particular, he focused on graphs with a cyclic substrate, a starting 
structure for his simulations constituted by a cycle of edges through all 
nodes A cyclic starting point ensures that the network 1s connected, so 
one can work with more analytically tractable mean path lengths rather 
than, say, median path lengths As discussed below, connectedness may 
or may not be a problematic assumption depending on the type of network 
under consideration 

Under the algorithm, edges are added to the graph allowing for a 
tendency toward clustering but with a variable degree of randomness (1 e , 
the algorithm permits edges to be added at random, as well as according 
to the clustering criterion) By stopping the algorithm at a specified density, 


5 There 1s an alternative network usage of the term clustering to refer to the clustering 
of ties around certain nodes through the formation of stars, with transitivity referring 
to triangulation (eg, Frank and Strauss 1986) We use the term clustering as relating 
to triangulation in accord with the usage adopted by the recent small world hterature 
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Watts observed some expected results With no randomness, graphs were 
highly clustered, but path lengths tended to be long, as 1s the case of the 
connected caveman graph With too much randomness, on the other hand, 
the random component of the algorithm overwhelmed the clustering as- 
pect, and low clustering resulted, as 1s the case for graphs with edges 
added randomly But Watts also found that a small amount of randomness 
in the model resulted in a class of graphs with relatively high clustering 
and short characteristic path length, that 1s, small world graphs Watts 
developed an explanation of this shortening of paths through the estab- 
hshment of "shortcuts" across the large cycle Although we do not use a 
cyclic substrate, the notion of shortcuts across a cycle 1s an important 
element in what we present below 

Conceptually our approach described below draws heavily on this 
work, but the implementation differs ın several important respects We 
do not concentrate on connected graphs, or graphs that originate from 
various substrates Rather than using a central tendency of characteristic 
path length, we investigate the dzstributzon of geodesics We simulate 
distributions of random graphs that have certain global properties emer- 
gent from local dependencies among edges To simulate, we use the Me- 
tropolis algorithm with several advantages * The first 1s that ıt permits 
us to simulate stochastic models for social networks that are derivable 
from first principles using the Hammersley-Chfford theorem (Besag 1974) 
and so possess a principled statistical basis Different models embody 
different assumptions about the relevant local social processes that might 
generate the network Accordingly, we are able to interpret any conclu- 
sions specifically ın terms of local social processes that may or may not 
generate global properties The second advantage 1s that, in principle, the 
algorithm ensures convergence to a given distribution irrespective of the 
starting point of the simulation As a result, we need not be concerned 
about a starting substrate, nor do we need to rely on an arbitrary stopping 
rule And we are able to derive statistics from a large sample of graphs 
from a given distribution, giving generality to our conclusions A third 
advantage 1s that model parameters can also, 1n principle, be estimated 
from observed network data (e g , Snyders 2002), allowing useful com- 
parisons between models under consideration and empirical observations 

Before we describe the details of our approach, we discuss briefly some 
aspects of clustering and of characteristic path length and connectedness 


? Several algorithms could have been used, see Snijders (2002) for a review of some 
other possibilities 
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Clustering 


In network-theoretic terms, the 1dea of clustering 1s closely related to that 
of structural balance (Cartwright and Harary 1956), whereby individuals 
who are friends (for instance) are consistent in their friendships (and 
nonfriendships) with other people As Holland and Leinhardt (1970, 1971) 
recognized, structural balance essentially involves the notion of fransı- 
tıvıty m graphs, which ın nondirected graphs 1s represented as the pres- 
ence of triangles If there 1s a triangle on a triad of nodes, the triad 1s 
transitive It will be apparent that a triangle comprises three two-stars 
If a two-star (which also involves a triad of nodes) 1s not part of a triangle, 
then that triad 1s zmtransztive 

Structural balance 1s the outcome of a local process We do not need 
to inspect the whole graph to determine whether a particular triad of 
nodes 1s balanced or not The proportion of transitive to intransitive triads 
in the graph 1s an aggregate measure of the tendency toward transitivity 
and structural balance, and hence of clustering Because any triangle 
contains three two-stars, ın what follows we use the proportion 37/5, as 
the clustering coefficient C, where 7 1s the number of triangles and S, the 
number of two-stars ın the graph as a whole (Newman, Strogatz, and 
Watts 2001) '? (Note that C = 1 1f all triangles are transitive and the graph 
can be partitioned into completely connected components ) In other words, 
the clustering coefficient assesses the sharedness of balance-type processes 
across the entire network As will be seen below, the statistics T and S, 
are also important in our models for Markov random graphs, so this 
proportion 1s also naturally related to our method of graph simulation 


Characteristic Path Length and Connectedness 


Watts (19998) used the mean geodesic across the graph as the measure 
of characteristic path length The problem with utilizing a mean, however, 
is that ıt can only be computed for connected graphs Can we expect the 
social networks we study to be connected? Erdos and Renyı (1959) showed 
that almost all random graphs with number of nodes 7 will be connected 
when the average degree 1s greater than ln (n) On a global scale, a network 
of a bilhon nodes would require average degree of around 20 for the 
Erdos-Renyi theorem to apply, assuming of course that the graph was 
indeed Bernoulli 

But ultimately ıt ıs unhelpful to do this simple calculation without 
taking into account substantive considerations, ın particular, the nature 


10 This definition differs shghtly from that used by Watts (1999a), the average of local 
clustering The two measures are in fact highly correlated 
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where 


1 the summation 1s over all subsets A of the set of network variables 
in X, 

2 24(x) = IL). 4%, 1s the network statıstıc corresponding to subset A, 

k = 2, exp{d, A424} 15 a normalizing quantity, and 

4 the parameter À, = O for all x unless all the variables 1n A are 
mutually (conditional) dependent 


C» 


Equation (1) 1s a general form For the purposes of this article the 
specific version we use 1s presented below as equation (3), which we will 
interpret 1n fuller detail But before doing so, there are some features of 
the general form (1) that are worth noting Each of the subsets A of 
network variables may be construed as a local social nerghborhood (Pat- 
tison and Robins 2002), that 1s, a site of mutual contingency among net- 
work ties, with such contingencies expressing a social process For in- 
stance, 1n the example of clustering above, the subset of possible ties 
A İz, 2k, Jk} constitutes a local social neighborhood and the associated 
network variables are mutually conditionally dependent The binary sta- 
tistic z,(x) 1s computed from x and takes the value one if all the possible 
ties in the subset A are present in x If all the possible ties in A are observed, 
there 1s a triangle (a subgraph configuration) observed in x If the pa- 
rameter A, 1s large and positive, the probability of observing the network 
x ıs enhanced if that triangle (the configuration corresponding to A) is 
observed If we take A to be not just a particular triangle, but a generic 
triangular effect (1e, we suppose that the parameter A, 1s the same for 
all possible triangles ın the network"), then the statistic z,(x) becomes a 
count of triangles ın the network, and the parameter A, represents the 
tendency for transitivity (1e , clustering) ın the network 

More generally, the models express the 1mportance of various types of 
network configurations (e g , tnangles), with each network configuration 
relating to a parameter A and with the associated z statistic being a count 
of that configuration in the network The expression in (1) then represents 
a probability distribution of graphs with the properties implied by the 
configurations (e g , models with a triangle parameter express tendencies 
to clustering) 

The simplest models arise 1f we assume no dependencies among network 
variables at all (1 e , social ties are independent of each other) Allowing 


dependent on each other) There 1s one and only one parameter for each subset of 
variables that are mutually conditionally dependent 

“ Technically this step imposes homogenezty across 1somorphıc network configurations 
(Wasserman and Pattison 1996) It corresponds to an assumption of sharedness of the 
behavior across the network 
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that the probability of any tie 1s constant across all possible ties, we then 
have the class of Bernoulli graphs, and the model 1s 


prob(X = x) = (1/x) exp {@L(x)}, (2) 


where L(x) 1s the number of ties 1n the network x and @ 1s a parameter 
relating to the number of edges (sometimes called the edge or density 
parameter) Here the relevant subgraph configuration 1s a single edge, 
and the model 1s expressed ın terms of such configurations, that is, the 
number of edges 

For social networks, the assumption of independent ties 1s generally 
implausible Frank and Strauss (1986) introduced Markov dependencies, 
whereby two possible edges are assumed to be conditionally dependent 
if they share a node For nondirected networks, the resulting model has 
parameters relating to a small number of configurations stars of various 
types and triangles For the purposes of this article, we shall concentrate 
on the four Markov configurations of figure 1 single edges, two-stars, 
three-stars, and triangles The model then becomes 


prob(X = x) = (1/k)exp (9L(x) + o,S, (x) + 0,5;(x) + TŒ), (3) 


where o, and S,(x) refer to the parameter associated with k-star effects 
and the number of &-stars ın x, respectively, and 7 and T(x) refer to the 
parameter for triangles and the number of triangles, respectively The 
interpretation of the triangle parameter has been discussed above 1n terms 
of clustering and balance, a &-star parameter relates to the propensity for 
k-stars to be present ın the network, that ıs, a tendency for individuals 
to have connections with multiple network partners ? Note that the Ber- 
7 

noullı graph distribution is a special case of (3) with only the edge pa- 
rameter nonzero For a Markov dependency assumption, the use of two- 
stars and triangles has become standard The rationale for the inclusion 
of the three-star parameters ın the models is given below * 

The model in (3) then represents a distribution of graphs based on edge, 
star, and triangle parameters The parameters relate to local subgraph 


15 Yt ıs well known that the parameters ın exponential random graph models are not 
independent Such interdependence 1s to be expected because the models treat two- 
stars as two-way interactions among single-edge variables, and triangles as three-way 
interactions Of course, two- and three-way interactions 1n statistical methods are not 
independent of each other, nor from main effects The complexity of the interactivity, 
however, does mean that small changes in parameter values may or may not have 
large effects This 1s the expected behavior of complex systems, and in part this article 
is intended to explore such behavior 

16 Model specification appropriate for observed networks is a vexed question This 1s 
not an issue that immediately concerns us here, but our current recommendation, based 
on the simulation results described below, 1s that at least three-stars should be included 
in Markov graph models fitted to empirical network data 
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o UF 


Fic 1 —Configurations represented by parameters ın the model single edges, two-stars, 
three-stars, and triangles 


configurations By setting the parameters to particular values and then 
simulating this distribution, we can investigate certain global network 
properties across the distribution of these graphs 

With the model 1n equation (3) as the basis for our simulations, the 
next few sections cover a number of technical details before we turn to 
our results In these technical sections, we outline the simulation approach, 
we present our framework for comparing graphs and graph distributions 
in order to determine important features in global structures, we discuss 
the appearance of frozen patterns when parameter values are large, we 
provide further detail on model interpretation and the rationale for the 
inclusion of three-stars, and we briefly discuss our choices of parameter 
values 


The Simulation Strategy 


We simulate graph distributions based on (3) using the Metropolis algo- 
rithm (Gilks, Richardson, and Spiegelhalter 1996) This technique was 
first utilized for social networks by Strauss (1986) In summary, the al- 
gorithm operates as follows We decide on the model we wish to simulate 
by fixing the values of the parameters, and we start with a randomly 
chosen initial graph with a fixed number of nodes At each iteration, we 
propose a new graph as a candidate for the next step in a Markov chain 
The candidate graph x’ 1s determined from the current graph x by se- 
lecting at random a possible edge from : to 7 and changing x, to 1 — x, 
We accept the candidate graph as the current graph whenever the can- 
didate graph has an increased probability of being observed according to 
(3) When the probability 1s not increased, we accept the candidate graph 
with probabihty 


r = exp {O(L(x) — L(x’) + o,(S,(x) — S,(x)) + o,(5,G9 — 5,(x^) 
+ rx) — T(x^)) (4) 


The algorithm establishes a Markov chain on the state space of all graphs 
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of order and converges to a distribution of the form of (3) There may 
be a “burn-in” period as the chain moves away from the starting graph 
toward the distribution 

It should be emphasized that the simulation does not converge to a 
particular graph What 1s produced ıs a distribution of graphs Graphs 
in this distribution will be more probable, and so more likely to appear 
in our sample, if they express the properties implied by (3) once parameter 
values have been set For instance, if we set 7 as positive, then there 1s 
a tendency for graphs ın the distribution to have more triangles than 
would be the case if the parameter value were zero or negative In this 
way, we can simulate graphs with greater clustering Of course, for tri- 
angles to occur at all there have to be some two-stars present, so that 
below, we typically set the two-star parameter as positive to ensure this 
As discussed below, this can be interpreted as a tendency for actors to 
have multiple network partners Of course, there needs to be a ceiling on 
this tendency, else there 1s no reason why people should not be partners 
with everyone else, and in realistic social networks, having too many 
network partners 1s costly ın terms of tıme and effort Accordingly ıt 1s 
useful to set the three-star parameter as negative We discuss the relevance 
of the three-star parameter more fully below The edge parameter, on the 
other hand, simply represents a baseline tendency for edges to be present 

We have simulated graphs of various sizes from 30 to 500 nodes Most 
of the results we present here are based on graphs of 100 nodes, 1n part 
because larger graphs cannot readily be depicted to illustrate various 
points Generally, we seek to simulate distributions of graphs with low 
density, principally because high-density graphs have high clustering and 
short paths anyway, so that the small world nature of such graphs 1s 
hardly in question And, once again, it 1s easier to illustrate certain points 
with depictions of low-density graphs, as the clarity of an illustration 1s 
sometimes lost when there are many edges present We achieve lower- 
density graphs by adjusting the edge parameter 0 ın (3) as required “ 

In our simulations, we use 500,000 iterations Burn-in 1s checked by 
observing the stabilization of the statistics across iterations As burn-in 
typically occurs relatively early, we generally remove the first 50,000 ıt- 
erations to ensure we are sampling from the distribution We then sample 
every 100th graph, giving a typical sample size of 4,500 from each 
distribution 

For each sampled graph, apart from the model statistics (number of 


7 We note that despite the usefulness of graphical depictions, we have chosen examples 
rather carefully to make the representations clear Of course, empirical networks may 
not result ın such clarity when depicted, so ın general the analysis 15 more important 
than the graphical representation 
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edges, two- and three-stars, and triangles), we calculate the following 
aggregate measures degree distribution, geodesic distribution in percen- 
tiles (although for simplicity we shall use quartiles here), and clustering 
coefficient (We also collect, for depiction, a number of graphs as typical 
representatives of the distribution ”) So for our graph distribution, we 
have an accompanying distribution of clustering coefficients The situation 
is somewhat more complicated for degree distribution, for each sampled 
graph from the graph distribution has 1ts own distribution of node degrees 
For instance, 1n our sample of graphs the frequency of nodes with degree 
one varies across a certain range Below we use a series of boxplots to 
represent this “distribution within a distribution ” 

Although ıt can be proven that ın principle the Metropolis algorithm 
yields convergence to the desired probabihty distribution, 1n practice con- 
vergence may take an exceptionally large number of steps For example 
if x 1s a high-probability graph, and its neighboring graphs x’ much less 
probable, then the algorithm may retain x for a very large number of 
steps When this happens, we refer to x as a frozen structure, and to the 
Markov chain as showing poor mixing (Gilks et al 1996) These circum- 
stances are discussed further below 


Comparing Graphs and Graph Distributions 


Our measures of global structure can be used to describe any of our 
sampled graphs, but ıt 1s difficult to interpret them ın the absence of a 
basis for comparison For instance, on what basis do we judge a median 
geodesic length as “short”? In the most general sense, there are a number 
of possible comparisons that could be made Pattison et al (2000) ıllus- 
trated how to draw judgments about global structure through comparing 
a graph of interest against a range of different graph distributions of 
increasing complexity Here, however, we make comparisons against a 
relevant Bernoulli graph distribution, because as explained above, we 
expect graphs 1n that distribution generally to have short average geo- 
desics and low clustering 

To consider the structure of an individual graph, our approach 1s as 
follows we simulate a comparative Bernoulli distribution of graphs We 
then examine the graph’s clustering coefficient C against the distribution 
of clustering coefficients from the Bernoulli sample If C 1s extreme in 
that distribution—that 1s, 1f 1t 15 greater than the ninety-fifth percentile— 
we Say that the graph 1s highly clustered Similarly, we say that the graph’s 
median geodesic (G50) 1s short 1f 1t 1s not extreme compared to the dis- 
tribution of median geodesics from the Bernoulli sample—that 1s, if it 1s 


3 We depict graphs using Pajek (Batagel) and Mrvar 2002) 
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less than the ninety-fifth percentile We can define short first-quartile 
geodesics (G25) and short third-quartile geodesics (G75) similarly We then 
define a small world graph as having a short G50 and high clustering P? 

We need to determine the appropriate comparative Bernoulli distri- 
bution We simulate the Markov graph distribution and calculate the 
mean number of edges from the sample We then simulate a Bernoulh 
distribution with the same expected number of edges We can then assess 
each graph in the Markov sample against the Bernoulli sample as in the 
previous paragraph, and determine, for instance, the percentage of small 
world graphs (noting that 1n the Bernoulli sample no more than 596 of 
graphs can be small world by definition) This enables us to assess whether 
the local parameterization tends to result in small world graphs or some 
other structure Át the same time, a comparison of degree distributions 
across the two graph distributions 1s helpful 1n understanding the graph 
structure, as will be seen below 


Frozen Patterns 


As noted above, for some choices of parameter values the simulation may 
reach a particular high-probabihty graph that remains as the current 
graph for a large number of steps ın the simulation. Such behavior 1s 
common as the size of the parameters increases Within the context of 
general pattern theory, Grenander (1993) described such behavior as freez- 
ıng and posed what he termed the first lımıt problem How does the 
probability distribution behave as the interactions among variables are 
made stronger’ In the context of model (3), the question becomes one of 
characterizing the random graph distribution as the parameters become 
larger 1n absolute value As Grenander demonstrated, scaling the param- 
eters by an increasingly large constant positive value 1/y inevitably leads 
to freezing In particular, as the parameters become larger (small values 
of y), the probability density becomes concentrated uniformly on a subset 


19 One could, of course, adopt more or less stringent versions of these criteria For 
instance, Watts (1999a) investigated caveman graphs with clustering coefficients close 
to one We show such examples below For the most part, however, we do not insist 
on clustering coefficients close to one but adopt the traditional statistical approach of 
investigating extremes ın a comparison distribution, in this case the distribution of 
Bernoulli graphs Of course, m a long enough simulation, a Bernoulli graph distribution 
will produce a (small) number of highly clustered graphs, so adopting the extreme 5% 
criterion 1s not unreasonable 
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of structures having minimum energy,” such structures are often very 
regular in form and are referred to as frozen patterns Conversely, as y 
increases 1n magnitude, the parameters of the model become very small 
and the graph distribution approximates a Bernoulh distribution In the 
context of statistical mechanics, the parameter y can be regarded as tem- 
perature—hence the description of patterns associated with small values 
of y as frozen Frozen patterns often occur in “degenerate” regions of the 
parameter space, as discussed 1n the next section 

The presence of freezing may be examined in several ways, for instance, 
by observing low variance ın the degrees of individual nodes across the 
iterations, or by tracking the number of times a candidate graph 1s ac- 
cepted by the simulation algorithm, since freezing 1s characterized by a 
very small acceptance rate 


Model Interpretation and the Importance of the Three-Star Parameter 


We have discussed above a substantive interpretation of the tnangle pa- 
rameter as expressing the outcomes of a local social process relating to 
structural balance ” The edge parameter represents a baseline tendency 
to form social ties The star parameters are somewhat more complex The 
two-star parameter may be interpreted as a tendency to have multiple 
network partners, as indeed may the three-star parameter What we pre- 
sent here are models with positive two-star and negative three-star pa- 
rameters Our substantive interpretation of this parameter pattern 1s that 
actors tend to want multiple network partners (positive two-star) but 
experience a cost in having too many (negative three-star) It should be 
noted that higher-order stars contain lower-order stars, so that a node 


“ Define the energy of a graph x as H(x) = —>,A,2(x) and the minimum energy m 
as the minimum value of H(x) over all graphs ın the state space If K denotes the set 
of graphs with minimum energy m and 1/y (y > 0) ıs a scaling constant for parameters, 
then a scaling of the parameters of model (1) yields the model given by prob(X — 
x) = (1/x) exp(2.-., 0, y,(x)) This model can be rewritten ın the form 


prob(X = x) = exp{—(H(x) — m)'yy İK) + 2, {—(H(x') — mi), 


from which it is clear that as y becomes smaller (and the model parameters become 
larger), the probability of graphs 1n the minimum energy set K approaches 1/| K | and 
the probability of all other graphs approaches zero 


* We want to be clear that we are not interpreting the Metropolis algorithm as in- 
stantiating such processes—rather that algorithm 15 used to simulate graph distribu- 
tions with certam properties These properties can be interpreted as the outcomes of 
(possibly unobserved) social processes An exponential random graph distribution may 
be construed as the stationary distribution of structures emergent from more overtly 
process-based models (Snyders 2001, 2002) 
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with degree four, for instance, 1s at the center of six two-stars and four 
three-stars Accordingly, there comes a point 1n the degree distribution, 
determined by the relative values of the two- and three-star parameters, 
at which a tendency for additional partners moves from positive to 
negative 

Models with three-star parameters are helpful in order to construct 
models that yield even rough approximations to observed networks If 
the highest-order star parameter 1n the model 1s sufficiently positive, the 
complete graph will be a mınımum energy graph, and the simulation will 
move toward the complete graph and stay there with very high proba- 
bility Such models have been termed “degenerate” by Handcock (2003) ” 

Likewise, if the higher-order star 1s sufficiently negative, then the null 
graph will be a minimum energy network, and the simulation will grav- 
itate toward the null graph For example, if the only star parameter in 
the model 1s for a two-star, then the null graph will be a minimum energy 
graph for sufficiently negative values of o,, and the complete graph will 
be a minimum energy network for sufficiently positive values of ¢, Indeed, 
the range of values of o, for which probability 1s widely dispersed across 
graphs ın the state space may be quite small For example, ın figure 2, 
we show graph statistics for graphs in the distribution for the model with 
a fixed edge parameter $ and varying two-star parameter o, It ıs clear 
from these figures that while the effect of o, 1s dependent on the value 
of 8, there 1s a relatively low range of parameter values for which the 
corresponding graph distributions have average densities ın other than 
the very low or very high range (see also Park and Newman 2004) Thus, 
models with just an edge and a two-star parameter may be too simple to 
capture some important characteristics of observed networks (such as 
some simple aspects of connectedness) Of course, the behavior of the 
models changes with the addition of a triangle parameter, but many of 
the same features persist 

Accordingly, we have included a nonzero three-star parameter ın all of 
the models that we simulate below (It 1s worth noting that stars are related 
to the degree distribution, so that to model up to three-stars 1s 1n effect 
to model the first three moments of the degree distribution ) 


22 For this class of models, the issue of model degeneracy for certain parameter values 
was first discussed by Strauss (1986) By model degeneracy, Strauss (1986) meant 
situations where H(x) tended in probability to a minimum as the number of nodes 
became large In fact, this notion of degeneracy relates to what Grenander (1993) terms 
the second lımıt problem How does the probability distribution behave as the order 
of the graph (and hence the number of random edges m the model) tends to infinity? 
In terms of Grenander's first limit problem, with fixed numbers of nodes, Handcock 
(2002, 2003) has extensively investigated degenerate parameter regions for two-star 
models for very small networks 
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Choosing Parameter Values 


In this article, we have chosen various parameter values based on our 
exploration of the parameter space, simply to illustrate various points In 
the main, we seek to demonstrate that 1t 1s possible to achieve properties 
akin to the small world with a few locally specified parameters that can 
be interpreted in terms of local behaviors More generally, we wish to 
illustrate certain behaviors of the model under different parameter con- 
ditions Our results are not intended as claims to universality, but rather 
to draw attention to possibilities, so 1n what follows we have pragmatically 
chosen particular parameter values for these purposes The following then 
1s not intended as an exhaustive investigation of the parameter space, as 
has been done by Handcock (2002, 2003) for two-star models 1n very small 
networks Ultimately, of course, any investigation of whether particular 
parameter values are more likely than others has to rest on empirical 
investigations of observed networks 1n the context of careful model spec- 
ification In related work not reported here (Robins 2003), our approach 
has been to estimate parameters from empirical network data, using newly 
available Monte Carlo maximum likelihood procedures, and then to sim- 
ulate from these estimates using the procedures 1n this article to investigate 
the global properties imphed by the empirical estimates 


SIMULATION RESULTS 
A Distribution of Small World Networks 


With our model parameterization, it 1s ın fact not difficult to produce 
small world graphs through a judicious choice of parameter values So 
we begin by presenting a distribution ın which a substantial majority of 
graphs satisfy our small world criteria, and then by adjusting the param- 
eter values we change the features of the distribution ın ways that are 
revealing For these simulations, we are aiming to produce graphs on 100 
nodes that have around 100 edges, so density 1s quite low, approximately 
2% 

The Markov random graph distribution we present is based on the 
following vector of parameter values (—4 0, 0 1, —005, 10), where the 
values are for the edge, two-star, three-star, and triangle parameters, re- 
spectively The mean number of edges for graphs in this sample was 119 5, 
the sample from the comparable Bernoulli distribution had a mean num- 
ber of edges of 1199 Table 1 presents the basic graph statistics for the 
Markov random graph sample with those for the Bernoulli sample We 
see that the means of the basic statistics are similar, except for triangles 
and hence for clustering 
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TABLE 1 
GRAPH STATISTICS FROM MARKOV RANDOM GRAPH AND BERNOULLI SAMPLES 


MARKOV RANDOM BERNOULLI 
GRAPH SAMPLE SAMPLE 
Mean SD Mean SD 
Edges 1195 12 3 1199 127 
Two-stars 289 60 285 62 
Three-stars 226 74 223 80 
Triangles 6 2 31 23 17 


Cluster coefficient 06 03 02 02 


Of course, we could perform the usual null-hypothesis statistical tests 
to determine whether there are significant differences between the means 
in table 1, but at conventional alpha levels for two independent samples 
of 4,500 each, we have rather excessive power, with a capability of de- 
tecting even slight differences It 1s more appropriate in these circum- 
stances to use a measure of effect size There are various possible measures 
of effect size that could be used We present here the differences between 
means scaled in terms of the standard deviation from the Bernoulli graph 
distribution, (4, — u,Yo For the means in table 1, the difference ın means 
does not exceed 007 of a standard deviation, except for the triangles, 
where the difference 1s 2 3 standard deviations, and for the clustering 
coefficient, where the difference 1n means 1s 2 4 standard deviations We 
infer that there 1s a substantial effect for clustering ın the Markov 
distribution 

In the Markov random graph distribution, 8396 of graphs have short 
G25 (first quartile of geodesic distribution), 82% have short G50 (the 95% 
cutoff in the Bernoulli graph distribution 1s a median path length of seven), 
but only 48% have short G75, 66% of these graphs have high clustering, 
and in this sample 54% are small world according to our criteria What 
seems to be happening here 1s that the increased clustering 1s not greatly 
affecting median path length (G50) but 1s at the cost of increasing the 
“long” geodesics (1e , above the third-quartile G75) This 1s illustrated ın 
figure 3, which compares median geodesics and third-quartile geodesics 
for both distributions The patterns of median geodesics are not greatly 
different, but for the longer geodesics there are substantially more infinite 
G75 in the Markov graph distribution And the comparison of degree 
distributions indicates that there are not great differences between the 
two samples (figure 4) 

Finally figure 5 presents a representative graph from the Markov graph 
distribution This 1s a small world graph with 120 edges, a median path 
length of five, and a clustering coefficient of 007 To the eye, the graph 
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graph sample and comparison Bernoulli graph sample 


Markov graph distribution 





Bernoulli graph distribution 
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FIG 4 —Small worlds degree distribution for Markov graph and Bernoulli samples (NB 
Dn refers to the number of nodes of degree n Boxplots indicate the range of Dx across the 
sample, outliers are excluded ) 
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FıG 5 —A small vvorld graph 


does not appear greatly different from a Bernoullı graph (not presented 
here), a couple of 1mportant features are a tendency for some long cycles 
and plenty of nodes with degree of four, five, or more (in line with figure 
4) It 1s worth noting that many of the cycles intersect one another In 
terms of Watts’s (1999a) notion of shortcuts, two cycles that jozntiy ın- 
tersect create shortcuts for each other This seems a highly efficient way 
to create short paths It 1s also worth noting that a node that 1s at the 
intersection of two cycles must be at least a four-star 


Long Path Networks 


We now present results for a simulation with parameters (—1 2, 005, 
—10,10) Compared to the previous example, the most 1mportant change 
1s to adjust the three-star parameter to be strongly negative, at the same 
time as to increase the edge parameter to achieve a comparable density 
of around 2% We produce a Markov graph sample with a mean number 
of edges of 123 6 (SD—4 1) Not surprisingly, the graphs ın this sample 
have fewer stars, especially higher-order stars, compared with the Ber- 
noulh for the Markov graph sample, the mean number of three-stars 1s 
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69 7 (SD —10 0) compared to 252 3 (SD=74 6) for the Bernoulli sample 
(difference 1n means 1s 2 4 Bernoulli standard deviations) But despite the 
positive triangle parameter, the mean clustering coefficient for the Markov 
graph sample of 0 030 (SD =0 02) ıs not substantially different from the 
mean clustering coefficient of the Bernoulli sample (0 023, SD —0 02, dif- 
ference 1n means 1s 0 4 of a standard deviation) According to our criteria, 
less than 15% of the Markov graphs are highly clustered The difference 
in the number of stars is clearly illustrated by the truncated degree dis- 
tribution of the Markov graph sample (see figure 6) In the sample, all 
have long first-quartile geodesics (G25) and 99 896 have long median 
geodesics (G50), although interestingly, few of the Markov graphs have 
infinite G75, whereas a substantial proportion of the Bernoulli graphs do 
Apparently, as the short and median geodesics tend to be longer in the 
Markov graph sample, more nodes become connected, so that there are 
fewer infinite geodesics This 1s borne out by figure 6, where the number 
of isolated nodes (DO) 1s lower ın the Markov graph sample 

In figure 7, we present a graph from the Markov graph sample that 
illustrates the effect of the longer G25 and G50 This graph has 122 edges 
and 1s typical of the distribution with low clustering coefficient and long 
G25 and G50 The rather tight “circularity” of the graph in figure 5 still 
seems to be apparent, but closer inspection reveals that paths tend to be 
more implicated in larger cycles There are fewer isolated nodes but, 
importantly, few stars of higher order than three In the graph 1n figure 
7 there are only four four-stars, compared to 194 four-stars in the graph 
mn figure 5 As a result, there are many fewer mutually intersecting cycles 

To illustrate the point further, we present in figure 8 a graph from a 
distribution generated from parameter values (—2 2, 0 05, —2 0, 10) For 
thıs model, vve have made the negatıve three-star effect even stronger and 
adjusted the edge parameter Without presenting all the details, some 80% 
of these graphs have long G50, and 9% have high clustering, with only 
1% small world It ıs clear that the graph with 94 edges ın figure 8 has 
long paths and zero clustering We compare it with an example of a 
Bernoull graph with 96 edges in figure 9 At this lower density, we see 
that the Bernoulli graph has itself a number of long paths, but the higher- 
order stars continue to create several larger intersecting cycles 


A Local Process to Produce a Global Small World 


Of course, we are not claiming that these types of simulations represent 
the only processes that can give rise to a small world What we do wish 
to illustrate 1s how locally specified effects may produce particular global 
structures Substantively, we argue that a network 1s likely to be small 
world if the following conditions apply 
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Fic 6—Long paths degree distribution for Markov graph and Bernoulli samples 
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Fic 7 —A graph with long median paths 


1 Individuals seek more than one network partner 

2 But the costs of maintaining many partners are high, so there 1s a 
tendency against a multitude of partners 

3 There is some tendency for network partners to agree about other 
possible partners (1 e , structural balance and clustering) 

4 But this tendency 1s neither too strong (else the network becomes 
too cliquelike with insufficient links between cliques for smaller ge- 
odesics — which we illustrate below), nor too weak (else there 1s 
little clustering in the network) 


The last condition might be interpreted 1n terms of Granovetter's (1973) 
strong- and weak-ties argument If all we have are strong ties, which tend 
to cluster, then our “reach” across the network 1s limited 

In an organizational context, DeCanıo, Dibble, and Amir-Atefi (2000) 
argued that an efficient communication structure 1n a network can emerge 
through mutually intersecting cycles, resulting from a tendency for mul- 
tiple partners combined with a cost against having too many partners 
They performed a series of simulations based on cost and reward functions 
that ın effect 1mplemented such rules This result does not seem to be 
widely known among small world researchers Of course, a shortcut as 
defined by Watts (1999a) ıs a global feature (because one can never be 
sure that a path 1s 1ndeed a shortcut unless one knows about the structure 
of the rest of the graph) But the basis of the simulations by DeCanıo et 
al (2000), as embodied ın their cost and reward functions, 1s locally spec- 
ified The fact that these simulations produce shortcuts through mutually 
intersecting cycles, 1n conjunction with our results reported here, provides 
one possible local description of how short paths may emerge 
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FiG 8 —A long path graph with low clustering 


A Denser Nonclustered “World” 


We now give an example of a denser network simulation, where what 1s 
of interest 1s not so much geodesic length (because as noted above path 
length tends to be short ın denser graphs anyway) Rather, ın this example 
we wish to make some points about the degree distribution and also to 
illustrate a tendency against clustering We simulate for a strong propen- 
sity for multiple partners, while retaining some cost against too many 
partners, but with a large negative clustering effect, using the parameter 
values (—3 2, 1, —0 3, —4 0) In this case, we have not fully compensated 
with a large negative edge parameter to create very low-density graphs 
The strong tendency for two-stars results ın a sample with a mean number 
of edges of 371 5 (SD —5 0), ımplyıng a mean density of 7 5%, is three to 
four times greater than ın the previous samples ” The first point to note 
is that the negative triangle parameter successfully removes most triangles 
from the Markov graphs, with the mean number of triangles being 10 
(SD — 1 0) compared to 67 9 (SD = 12 5) ın the comparative Bernoulli sam- 


? At this density, distinctive features in the structure are not readily discernible to the 
eye, so we do not present examples of actual graphs 
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Fic 9—A Bernoulli graph 


ple The two degree distributions are presented in the top two panels of 
figure 10 It 1s immediately apparent that the Markov sample has a degree 
distribution that 1s truncated at both ends compared to the Bernoulli 
sample 

Of course, this result in part arises from the restriction on the number 
of nodes Substantively, these parameter values can be interpreted to 
describe persons who wish to have many network partners but have only 
99 others from whom to choose Hence we see very few low-degree nodes 
(In that sense, the effect of the actual parameter value 1s dependent on 
the number of nodes, an issue we discuss further below ) 

It 1s interesting to compare this result with the degree distribution for 
a simulation wzthout the negative triangle (negative 7) effect, with pa- 
rameters (—3 2, 1, —0 3,00) The sample for this distribution has a mean 
number of edges of 383 5 (SD —5 0), some two standard deviations above 
that of the negative 7 distribution, but has a mean number of triangles 
of 52 7 (SD =8 3), over six standard deviations above the negative 7 dıs- 
tribution This 1s not surprising What 1s interesting, however, 1s to con- 
sider the degree distribution, which 1s presented 1n the bottom panel of 
figure 10. While there are some differences between the two Markov 
degree distributions, they nevertheless seem quite similar 1n comparison 
to the Bernoulli distribution. An 1mportant point to make here, then, 1s 
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FIG 10 —Degree distributions for some denser graphs 
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that knowledge of the degree distribution does not necessarily give knowl- 
edge about clustering 

The absence of triangles 1s not just a curiosity but has substantive 
interpretation There are real social networks where we see fewer tniangles 
than in random networks, for instance, networks of heterosexual partners 
What we have in the sample of negative 7 networks are graphs with 
hardly any triangles (three-cycles) but many four-cycles ” We might expect 
to see such patterns 1n promiscuous heterosexual networks, with triangles 
perhaps more prominent ın homosexual (and maybe bisexual) networks 
The balance between three- and four-cycles 1n a sexual network may then 
relate to levels of multiple partnerships among and across actors of dif- 
ferent sexual orientation Such outcomes have clear implications for the 
understanding of HIV networks 

sexual networks are a simple example of the possible relevance of four- 
cycles as opposed to three-cycles But they are not the only example In 
organizational contexts, for 1nstance, four-cycles have important theoret- 
ical implications in terms of generalized exchange and the building of 
trust (Lazega and Pattison 1999) 


RANDOMNESS AND STRUCTURE 


We introduced frozen patterns earlier as minimum energy graphs and we 
observed that there 1s a tendency for the simulation algorithm to become 
"trapped" by such graphs Although this poor mixing on the part of the 
Metropolis algorithm is deleterious ın terms of producing an adequate 
simulation of the graph distribution, such regions of the parameter space 
are often interesting in their own right, because they represent regular 
structures When such frozen patterns represent regular and stable struc- 
tural forms, a description of the forms that they take and the class of 
models from which they are derived are of theoretical interest ın their 
own right As we noted earlier, we can consider y as a temperature factor, 
so that as the temperature increases, the model can move from a frozen 
to a less regular structure 


From Caveman Graphs to Random Graphs 


Watts (19992) introduced caveman graphs as part of his ınıtıal argument 
about the importance of paths These graphs comprise subsets of nodes 
(the caves), which are fully connected within subsets but have no con- 
nections between subsets (1 e , several complete components ın the graph, 


74 That 1s, cyclic paths that involve four nodes 
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with no connections between components) A connected caveman graph 
has a cyclic pattern connecting the caves These two graph types are 
clearly highly structured, and the chance of observing them ın a Bernoulli 
distribution of graphs 1s miniscule 

We present here a series of simulations with a fixed set of values that, 
once scaled by a temperature factor y, become the parameter values For 
ease of visual presentation of graphs, we fix the number of nodes to 30 
in this illustration. We select a vector A = (—3 2,1, —0 3,3 0), these values 
are based on the previous example, except that here we need a large and 
positive triangle value (necessary to create the caves) ^ In figure 11, we 
compare two graphs from simulations for y = 1 and y = 6 The graph 
for y = 11m the top panel of figure 11 1s a highly regular caveman struc- 
ture, comprising two fully connected components of 15 nodes each The 
second graph, however, 1s a much less regular structure, as we explain 
below 

When y = 1, every sampled graph from our simulation has 210 edges 
and clustering coefficient of one Across the sample (after burn-in), the 
degree of each vertex never changes (which means that no changes to the 
graph are accepted for every 100 iterations) Clearly this simulation has 
frozen, mirrored by the perfect transitive structure presented ın figure 11 
We have checked this stimulation with a number of random starts to 
ensure we have not found a local energy minimum There are variations 
depending on random starts, but 1n all cases, two large caves resulted 
although there might be variation in the size of the caves with the number 
of nodes varying between 13 and 17, and possibly some isolates ap- 
pearing ” As the temperature increases, some small variation begins to 
appear For instance, by the time y = 4, a mean of 2 5 changes to the 
graph are accepted for every 100 iterations, so the structure 1s not com- 
pletely frozen In the y = 4 sample the mean number of edges 1s 205 
(SD — 2), and the mean clustering coefficient 1s 0 97 (SD —0 01) The me- 
dian G50 1s infinite Yet, at some point between y = 4 and y = 6 we see 
a phase transition, as the structure melts (At y = 5, for instance, much 
longer runs are required to burn in to a highly regular two-cave structure, 
indicating closeness to a transition ) 

At y — 6, however, the distribution shows no signs of being frozen at 
all The graph ın the lower panel of figure 11 has 180 edges and a clustering 


75 "The size of the triangle parameter, relative to the other parameter values, determines 
the number and size of the caves As shown ın figure 11, the value of three chosen 
here produces two caves when y = 1, whereas if we had chosen a value of two, the 
result would be three caves 


26 We have similarly checked other simulations described in this section with multiple 
starts to reach consistent conclusions 
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coefficient of 0 44 Across the sample, the mean number of edges 1s 173 0 
(SD —5 7), and the mean clustering coefficient 1s 0 427 (SD=0 02) In this 
simulation, an average of 65 changes to the graph are accepted for every 
100 iterations And the median G50 1s now two Yet this 1s not quite a 
totally random graph distribution A comparable Bernoulli graph sample 
indeed has a median G50 of two, but has a mean clustering coefficient 
of 0 397, over one standard deviation below that of the y = 6 sample 
So a large proportion (94%) of graphs in the Markov sample are more 
highly clustered than the Bernoulli graphs and have similar median path 
lengths, that 1s, they are small world graphs by our criteria But by the 
time we increase the temperature to y = 10, there is little to differentiate 
the sample from a comparable Bernoulli sample ” For instance, the mean 
clustering coefficient 1s 0 42 (SD —0 02) compared to 041 (0 03) for the 
comparable Bernoulh sample, with median G50 remaining at two for 
both samples 

This result concurs with those of Watts (1999a), m that at a certain 
temperature a small addition to “randomness” (in our case, a small increase 
in temperature) pushes a structured graph distribution with long median 
geodesics into a less regular distribution with frequent small world graphs 
The ratios of the parameter values are of course retained as temperature 
changes, but these ratios determine the frozen structure So the particular 
ratios chosen here lead to a clustered caveman world for low temperatures, 
whereas for our first small world example above, decreasing the temper- 
ature results ın an empty graph, principally because ın that case we chose 
parameters to produce a low-density graph 

We have experimented with various parameter ratios, and with judi- 
cious choices we can produce various structural forms at low tempera- 
tures In figure 12 we show some interesting examples that involve neg- 
ative triangle parameters, producing cyclic structures of various types In 
the top panel of the figure we have a structure of cycles produced from 
the parameter vector (—5 0, 5 0, —10,—6 0) The first point to note 1s that 
this structure has four structural equivalence classes and could be sim- 
plified into a perfectly fitting block model (White, Boorman, and Breiger 
1976) There are no within-block ties, and each node 1s tied to every other 
node ın two other blocks This structure can be construed as a frozen 
generalized exchange structure, highly reminiscent of Bearman’s (1997) 
cycle of marriage exchange among residents of Groote Eylandt (except 
that the marriage cycle had eight equivalence classes) By adjusting the 
parameters slightly, with a parameter vector (—4 2, 10, —0 05, —6 0), we 


77 Although the Bernoulli sample does seem to have somewhat wider dispersion for 
the statistics 
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FIG 12 —Some frozen graph structures Top panel cyclic structures with four structural 
equivalence classes Bottom panel complete bipartite graph 


produce the graph 1n the bottom panel of figure 12, a complete bipartite 
graph 


CONCLUSIONS 


In this arücle, we have (1) shown how to simulate a distribution of Markov 
random graphs based on assumptions about simple local social processes, 
(2) developed a method to examine the resulting global structures by 
comparison with an appropriate Bernoulli distribution of graphs, (3) pro- 
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vided examples of various stochastic global worlds that may result, 1n- 
cluding small worlds, long path worlds, and dense nonclustered worlds 
with many four-cycles, (4) suggested one locally specified social process 
that may result ın small world global properties, (5) shown how parameter 
scaling relates to the movement from structure to randomness, with a 
phase transition occurring at a certain scaling (temperature) so that a 
highly regular structure melts into a less regular form, and (6) provided 
examples of frozen structures, including highly clustered caveman graphs, 
bipartite structures, and global cyclic structures involving structurally 
equivalent groups 

There are some important points to make that bear on future work 
First of all, our methods are deliberately based on small-scale fixed-node 
networks, which constitute a large proportion of the empirical work ın- 
volving social network analysis Using these methods, an empirically ob- 
served network can be compared to an appropriate Bernoulh graph dis- 
tribution to investigate levels of clustering, geodesic length, and degree 
distribution, permitting an examination of global structure The methods 
could be further built on by the comparison of an observed network 
against a range of graph distributions with given properties expressing 
certain structural hypotheses, as 1n Pattison et al (2000) Such empirical 
examination need not be confined to unimodal networks we are currently 
studying global structures of interlocking directorships through similar 
approaches to bipartite graphs (Robins and Alexander 2004) On the other 
hand, to consider local structure, one might fit an exponential random 
graph (p*) model to the network, using recently developed Monte Carlo 
maximum lkehhood techniques (Handcock 2002, 2003, Snijders 2002) ”” 
Even so, model specification remains an important issue, especially with 
the need to avoid degenerate regions of the parameter space Snyders et 
al (manuscript) propose new specifications for exponential random graph 
models that go beyond Markov graph parameterizations These specifi- 
cations assist ın avoiding degeneracy when fitting models to empirical 
network data 

But these methods open additional possibilities for model examination 
by fitting models and then producing a distribution of graphs as described 
above, a researcher can investigate whether the model 1s successful in 
reproducing aspects of the observed global structure If the global struc- 
ture 1s not successfully reproduced, then 1t may be that model specification 
should be reconsidered In our introduction, we argued that observation, 
modeling, and simulation should not be seen as separate, but rather as a 


28 Approximate pseudolikelihood techniques (Strauss and Ikeda 1990), previously a 
standard approach, should now be relied on only when maximum likelihood 1s not 
feasible 
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package of complementary tools To examine simulated global structure 
from a model fitted to an observed network may be a demanding test of 
model specification," but ıt clearly illustrates the value of combining these 
approaches 

Even though our focus has been on fixed-node networks, the effect of 
given parameter values for networks with different numbers of nodes 1s 
an important issue This is a matter for ongoing work There are con- 
ceptual problems here, however, as the number of nodes becomes ın- 
creasingly large The Markov random graph dependence assumption 1m- 
poses dependency among all possible ties from one actor to all other actors 
This seems to us, ultimately, to be untenable on substantive grounds 
individuals do not “know” all other actors 1n a large social network and 
may have no plausible opportunity to meet some “distant” actors In what 
sense can such highly unlikely ties be said to influence possible ties to 
those who are much more socially *proximate"? It 1s for such reasons that 
Pattison and Robins (2002) introduced the notion of overlapping soczal 
settings as "social locales" within which dependency amongst ties can come 
into being In this case, examination of node scale-up issues may not 
require extension to graphs of indeterminate size 

There 1s much to be done on parameter scale-up and the transition 
from less to more regular structures The more regular structures are those 
on which much of social network analysis has focused We have presented 
some simple examples here, but there are more complex possibilities in 
a multiple network framework, Pattison (2002) showed how freezing a 
stochastic model can result in classic balance and strong-weak tie 
descriptions 

We began by referring to the global structure of the Medici network 
as the outcome of local social processes Clearly, the Medici were particular 
in that network, and Padgett and Ansell make clear why they were his- 
torically special In the end, our models may not be able to rest on node 
homogeneity The qualities of the actors are likely to count It 1s in the 
interplay of actor attributes and network ties that more realistic local 
social processes will be described Thus, an important next step 1s to extend 
the approach described here to social selection models (Robins, Elhott, 
and Pattison 2001), to social influence models (Robins, Pattison, and Elliott 
2001), and to dynamic conjunctions of the two It remains to be seen 
exactly what the implications of these extensions will be for global struc- 
ture, but ıt 1s hkely that such a move will have the added benefit of 
modeling the greater dispersion of degrees typically seen ın empirical 
networks compared to homogeneous random graphs (Albert and Barabási 
2002, Pattison and Wasserman 2002) 


? Although not an impossible one—see Robins (2003) 
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The authors explore the evolution of the structure and performance 
of a social network in a population of individuals who search for 
local optima 1n diverse and dynamic environments Individuals 
choose whether to innovate or imitate, and 1n the latter case, from 
whom to learn The probabilities of these possible actions respond 
to an individual's past experiences using reinforcement learning 
Among some of the authors’ more interesting findings 1s that a 
population’s performance 1s not monotonically increasing 1n either 
the reliability of the communication network or the productivity of 
mnovation 


INTRODUCTION 


The acceleration of scientific progress in 17th-century Western Europe ts 
often attributed to the founding of learned societies and to 1mproved 
communication among contemporary researchers “Here was a widely 
dispersed population of intellectuals, working ın different lands, using 
different vernaculars—and yet a community What happened 1n one place 
was quickly known everywhere else, partly thanks to a common language 
of learning, Latin, partly to a precocious development of courier and mail 
services, most of all because people were moving 1n all directions In the 
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seventeenth century, these links were ınstıtutıonalızed in the form of 
learned societies with their corresponding secretaries, frequent meetings, 
and periodical journals” (Landes 1998, pp 204—5) The emergence of such 
formal and informal networks of scientists, triggered by 1mprovements ın 
communication technology and transportation, marked the beginning of 
the institutionalization of scientific investigation which eventually paved 
the way to the Industrial Revolution 

The latter half of the 20th century has seen another major explosion 
in the formation of such networks and informal communities Thanks to 
the advent of the Internet and the World Wide Web, the time delay in 
acquiring information has shrunk from weeks (or at best, days and at 
worst, months) to minutes In the communities of research scientists, learn- 
ing what others are working on and what methods they are deploying 
previously required mailing letters requesting working papers, frequent 
trips to professional meetings and workshops, and laborious search 1n the 
dust-covered library stacks ^ All this has changed dramatically in recent 
years A letter sent using the postal service 1s replaced with an e-mail 
which arrives at the desktop of the other party within seconds, or by 
directly downloading the paper from the individual's Web site What 1s 
more, there exist ongoing projects that facilitate dissemination of research 
over the Internet via a decentralized database of working papers, journal 
articles, and software components * Downloading a working paper 1s now 
merely a matter of pointing a cursor and clicking The library, a physical 
institution that was once the hallmark of academic community, 1s grad- 
ually disappearing into the network of connected computers which allow 
us to search, check out, and read material at our desktop PCs 

The 300 years of separation notwithstanding, these two episodes share 
a common element the prominent role played by the social learning net- 
work ın achieving and sustaining scientific progress at the macrolevel 
from the chance occurrences of local innovations scattered across wide 
geographic areas Furthermore, these issues are not unique to scientific 
processes but also apply to markets and organizations, for what at work 


? See Liberman and Wolf (1997) who investigate knowledge flows from conferences 


* RePEc (Research Papers ın Economics) at http //repec org, an international collab- 
orative project ın economics, links various sources of research publications in over 30 
countries and enable their distribution via electronic media As of November 2002, 
they report that the “RePEc database holds over 176,000 items of interest, over 85,000 
of which are available online” Also, as of November 2002, SSRN (Social Science 
Research Network) at http //www ssrn com claims to have m its eLibrary an abstract 
database of over 44,900 working papers and forthcoming papers as well as an electronic 
paper collection of over 25,400 downloadable full-text documents 
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is quite fundamental—finding better solutions to a stream of problems * 
Given this relationship between local innovations and social learning via 
community networks, what are the implications for an individual’s de- 
cision to engage 1n independent innovation or to access the social network 
for observing the ideas of others? If the networks themselves are an out- 
come of interactive choices among individuals as to whom to observe and 
whom to ignore, what are the determinants of their emergent structure? 
Will an improvement ın communication technology be sufficient to gen- 
erate superior performance at the individual and the community level, or 
are there other complementary factors essential for a social network to 
realize these potential gains? Our objective 1s to explore these issues by 
building an agent-based computational model of an evolving social net- 
work with potentially innovative individuals and analyzing its emergent 
structure as well as its long-run performance 

Given the nchness of these processes, the model we develop 1s stylized 
and 1s designed to encompass three generic features of these knowledge 
processes First, agents are faced with problems to solve which we model 
as an effort to achieve a local optimum 1n the space of possible things 
that one can do? This optimum 1s uniquely defined for each individual, 
where similarity among optima 1s to be interpreted as similarity in the 
problems being solved Second, agents must choose how to allocate their 
effort between individual learning (innovation) and social learning (ımı- 
tation) When they engage in learning from others, an agent decides from 
whom to learn, which takes the form of establishing links 1n a social 
network These choices are made probabilistically, and the probabihties 
are adjusted over time based on personal experience This modehng struc- 
ture allows us to examine the emergent structure of the social network 
in terms of how observation probabilities are distributed across 1ndivid- 
uals as well as to track the evolving choice between innovation and 1m- 
ıtatıon for each individual Our model also captures the fact that the 


* See Podolny, Stuart, and Hannan (1996) for an analysis of knowledge production via 
a network in the semiconductor industry 


* In defense of this view, we appeal to Kuhn's (1962) interpretation of “normal science 
as puzzle-solving" in which a scientist 15 someone driven by the desire to search for a 
solution (a goal yet unknown) to the problem at hand “Bringing a normal research 
problem to a conclusion 1s achieving the anticipated ın a new way, and it requires the 
solution of all sorts of complex instrumental, conceptual, and mathematical puzzles 
The man who succeeds proves himself an expert puzzle-solver, and the challenge of 
the puzzle 1s an important part of what usually drives him on” (Kuhn 1962, p 36) 
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actual outcomes of these efforts tend to have a substantial random aspect ° 
Third, knowledge creation and diffusion occurs 1n the context of a chang- 
ing environment as represented by stochastic movement 1n optima This 
feature captures the fact that individuals do not typically solve one prob- 
lem once and for all, but instead must solve a sequence of problems which, 
to varying degrees, can at least partially be solved using solutions to 
previous problems 

This model 1s used to explore how the innovativeness of individuals 
and the reliability of the communication technology 1mpact network struc- 
ture and performance, and how those relationships depend on the char- 
acteristics of the environment Several interesting properties emerge 
When communication technology 1s sufficiently poor, the results are quite 
intuitive For example, performance is improved by enhancing the reh- 
ability of communication and making agents more productive ın ınno- 
vation However, when communication 1s sufficiently efficacious, further 
enhancement of it can actually be detrimental to performance The ın- 
tuition 1s that the increased social learning from a more reliable network 
leads to local homogenization of agents ın terms of their solutions to 
problems This lack of diversity within the social network results 1n a 
population of agents who are ill equipped to adapt to a changing envi- 
ronment Second, when reliability 1s of moderate quality, a rise in the 
productivity of innovation can be deleterious to average performance ın 
the population The key insight here 1s that imitating others 1s both a 
social good and a social detriment Engaging ın imitation fails to add to 
the stock of knowledge, but ıt does serve to spread worthwhile ideas 
Thus, an agent who imitates 1s improving the value of the network, which 
can enhance collective performance When network reliability 1s mod- 
erate, agents may be engaging 1n too much individual learning from a 
societal perspective, ıt might be better to tap into the network to pass 
along ideas and develop more useful links (which can only be achieved 
through experience) Making agents more innovative induces them to use 
the network even less and thereby exacerbates the problem of a poorly 
developed network and inadequate sharing of ideas A third property 1s 
that when network reliability 1s low, the social network 1s less structured 
when agents are more innovative The simple reason 1s that they choose 


$ See Mokyr (1990, p 152) “Over most of human history, technological change did not 
take place, as 1t does today, 1n specialized research laboratories paid for by research 
and development budgets and following strategies mapped out by corporate planners 
well-informed by marketing analysts Technological change occurred mostly through 
new ideas and suggestions occurring 1f not randomly, then certainly in a highly un- 
predictable fashion Demand conditions may have affected the rate at which these 
ideas occurred, and may have focused them in a particular direction, but they did not 
determine whether a society would be technologically creative or not” 


940 


Discovery and Diffusion of Knowledge 


to engage in more individual learning Since they then access the network 
less, they are less effective ın developing worthwhile links When network 
reliability 1s high, however, those two measures—the structure of the social 
network and the capacity for innovation—move together This ıs driven 
by the fact that the quality of a network, ın terms of the population's 
possessing a diverse array of useful 1deas, rises with the productivity of 
innovation But that quality cannot be adequately tapped 1f reliability 15 
low A more structured network emerges when both the reliability and 
the quahty (which depends on the capacity for innovation) of the network 
are high While innovation and imitation are substitutes at the individual 
level, they are complements at the population level through the mecha- 
nism of social learning 


Related Work 


While there 1s a wealth of research exploring the 1mplications of a network, 
there 1s much less formal theoretical work that endogenously derives social 
learning networks An early exception 1s the work of Carley (1990, 1991) 
She develops a simulation model ın which interagent interaction leads to 
shared knowledge, which then determines the likelihood of further in- 
teractions Her model differs from ours 1n two crucial ways First, she 
concentrates purely on the social network as the learning mechanism, 
individuals do not innovate The dynamic behavior of the network 1s then 
solely driven by the endowed differences 1n knowledge across individuals 
Second, she specifies the likelihood of interaction between two agents as 
being determined directly by the amount of 1nformation they share, the 
larger the set of common information, the higher the probabihty that they 
will interact agam In contrast, our model takes a more bottom-up ap- 
proach We assume that the agents are motivated by private goals For 
each agent, the probability of observing another agent 1s adjusted on the 
basis of the past value of other agents’ information ın attaining her goal 
This approach highlights our perspective that the likelihood of an 1nter- 
action between two individuals 1s an emergent outcome and should be 
modeled as such 

More recent work 1s based on the specification of goals for agents, and 
it is the striving for attainment of these goals (“utility maximization”) that 
determines a network’s structure Jackson and Wolinsky (1996) charac- 
terize the properties of a stable network—one ın which no agent wants 
to create or destroy links—while the dynamics of network formation 1s 
explored in Bala and Goyal (2000), Hummon (2000), and Jackson and 
Watts (2002) They assume agents adaptively adjust their networks so as 
to improve performance Though utilizing a distinct approach, a prede- 
cessor to these discussions 1s Huberman and Hogg (1995), but ıt differs 
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from our model ın several substantive ways First, the payoff to an agent 
from a given network 1s exogenously fixed, while ın our model, ıt evolves 
with what an agent can learn from other agents Second, it does not 
consider an agent’s choice of innovation versus imitation, but focuses 
exclusively on network structure Third, Huberman and Hogg’s study 
does not allow agents to adapt their search process 1n response to expe- 
rience, while in our model, this 1s done through reinforcement learning 


THE MODEL 
Agents, Tasks, Goals, and Performance 


Consider a social system consisting of M individuals Each individual 
1 € İl, 2, , M) engages 1n an operation that can be broken down into 
N separate tasks There are several different methods that can be used 
for each task The method chosen by an agent for a given task 1s rep- 
resented by a sequence of d bits (0 or 1) such that there are 27 possible 
methods available for each task In any period £, an individual : 1s then 
fully characterized by a binary vector of N d dimensions Denote it by 
z,(t) € (0, 1)" so that z,(£) = (z; (D , 2, (£) and z;(t) = (z;" (8), j 
z4(t)) € (0, 1)" ıs 1ndividual z's chosen method ın task £ € (1, , N) 
An example with N — 24 and d — 4 1s given below 


task (h) #1 #2 #3 #24 
task methods (z7(/)) 1101 0010 1001 1110 


Note that there are 16 (= 29) different methods or bit configurations for 
each task What 1s shown above for a given task represents a particular 
method chosen out of the 16 available methods Given that the operation 
is completely described by a vector of 96 (= 24 x 4) bits, there are then 
2% possible bit configurations for the overall operation 

In measuring the degree of heterogeneity between two methods vectors, 
z, and z,, we shall use “Hamming distance,” which is defined as the number 
of positions for which the corresponding bits differ 


N d 
De, 2) = 2, 2, Ja — z/^| (1) 
h=1k=1 


Each individual possesses a goal vector which may be different from 
one period to the next Let 2,() € (0, 1)”7 be the goal vector of agent z ın 
period £ That Z,(t) can differ across agents implies diversity ın agents’ 
problems, or alternatively, ın their environments, so that the optimum 
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differs The degree of turbulence ın task environments 1s captured by 
intertemporal variability ın 2,(4) 

The individuals are uninformed about z,(t) ex ante, but engage ın search 
to get as close to ıt as possible Given N tasks with d bits ın each task 
and the goal vector Z,(/), the period-t performance of individual z ıs then 
measured by z,(¢), where 


TO ZN d-D((, 40) (2) 


The performance of a social system 1s measured by how close the indi- 
viduals are to their respective goals We let z(£) denote the population- 
average performance ın period £, 


1 M 
z(t) = . 1, (t) (3) 


Modeling Innovation and Imitation 


In any given period, an individual’s search for an optimum is carried out 
through two distinct mechanisms, mnovation and imitation Innovation 
occurs when an individual independently discovers and considers for 1m- 
plementation a random method for a randomly chosen task Imitation 1s 
when an individual selects someone and then observes and considers for 
implementation some randomly chosen task currently deployed by that 
agent ” Though a single act of innovation or imitation 1s assumed to be 
a single task, this 1s without loss of generality If we choose to define a 
task to include d' dimensions, then the case of a single act of innovation 
or imitation involving two tasks can be handled by setting d = 2d'* In 
essence, what we are calling a “task” 1s defined to be the unit of discovery 
or observation The actual substantive condition 1s instead the relationship 
between d and Ñ, as an agent’s innovation or imitation involves a smaller 
part of the possible solution when d/N 1s smaller 

Whether obtained through innovation or imitation, an experimental 
method ıs actually adopted if and only if its adoption gets the agent closer 
to her goal by lowering the Hammıng distance between her new methods 


7 It may be useful to think of innovation and imitation as being analogous to mutation 
and crossover, respectively, 1n evolutionary biology 


8 There 1s a restriction in that an agent only has the option of adopting all d dimensions 
or none 
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vector and her goal vector For clarity, consider the following example 
with N = 5 andd = 2 


agent 2's goal vector 011İ10110101101 
agent ?'s current methods vector 01]01|11|00]|11 


The relevant operation has five tasks In each task, there are four distinct 
methods that can be tried (0,0), (0,1), (1,0), and (1,1) The Hamming 
distance between z’s current methods vector and her goal vector 1s then 
five Suppose ? observes the method used for task 4 by another agent 7 
(Æ 2) whose methods vector 1s 


agent 7’s current methods vector 10110111101101 


Since 7's method ın task 4 1s (0,1), when it 1s tried by agent 2, her exper- 
imental methods vector becomes 


agent 2’s experimental methods vector O1İ01İ11İ01İ11 


which then reduces the Hammıng distance to z”s goal to four, hence, the 
experimental methods vector becomes z’s new methods vector 


Endogenizing Choices for Innovation and Imitation 


In each period, an individual may engage ın either innovation or imitation 
by using the network Exactly how does an individual choose between 
innovation and imitation, and if she chooses to imitate, how does she 
decide whom to imitate? We model this question as a two-stage stochastic 
decision process with reinforcement learning Figure 1 describes the timing 
of decisions in our model In stage 1 of period £, individual z isin possession 
of the current methods vector, z,(£), and chooses to innovate with prob- 
ability g,(é) and imitate with probability 1 — g,(6) It she chooses to ın- 
novate then, with probability uy”, she comes up with an idea which is a 


randomly chosen task, £ € (1, , Ñ}, and a randomly chosen method, 
z”, for that task such that the experimental method vector 1s z(t) = 
(z; (0, 00 A ,Z, (D) wis a parameter that controls the 


productivity of an agent’s innovation This experimental vector 1s adopted 
by z if and only if its adoption lowers the Hammung distance to her current 
goal vector, 2,(t) Otherwise, 1t 1s discarded 


— (2 if DG, 2,0) < DE, £ (t) 


"ETD = e if DEO, 2.0) 5 DEW), 2.0) (4) 


Alternatively, with probability 1 — uz”, the individual fails to generate an 
idea, ın which case z,(£ + 1) = z,(f) 


Now suppose individual z chooses to imitate ın stage 1 Given that she 
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decides to imitate someone else, she taps into the network to make an 
observation Tapping into the network 1s also a probabilistic event, ın 
which with probability x”, the agent 1s connected to the network, while 
with probability 1 — uz”, the agent fails to do so w” measures the reli- 
ability of the communication technology and thereby the network An 
agent that 1s connected then enters stage 2 of the decision process ın which 
another agent must be selected to be studied and possibly imitated Let 
p(t) be the probability with which z observes ? in period £ so 
>,a: (D) = 1 for all: If agent z observes another agent /, the observation 
involves a randomly chosen task # and the current method used by agent 
l ın that task, z/(f) Let z’) = Gi, , 2, (b) 2), 2770), ; 


z,(8)) be the experimental vector Adoption or rejection of the observed 
method ıs based on the Hamming distance criterion 


AGT) [re af DG, 2/6) > DEO, 20) ©) 


If the agent fails to connect to the network, which occurs with probability 
1— py”, 2,6 + 1) = x) 

The probabilities q,(£) and (pit), , bo (0, 976), , $(t)) are 
adjusted over time by individual agents according to a reinforcement 
learning rule We adopt a version of the experience-weighted attraction 
(EWA) learning rule as described in Camerer and Ho (1999) ? Using this 
rule, q,(2) ıs adjusted each period on the basis of evolving attraction mea- 
sures, B;"(t) and B;”(4), for innovation and imitation, respectively The 
evolution of B?” (t) and B*”(t) follows the process below 


B”(t b) = 
@Bi"(t) + 1 if ? adopted a method through innovation ın £ (6) 
dBi (t) otherwise, 

BY tel) = 
QB;"(b) - 1 if 2 adopted a method through imitation ın £ (7) 
B” (t) otherwise, 





where $ e (0, 1] Hence, if the agent chose to pursue innovation and 
discovered and then adopted her new 1dea, then the attraction measure 
for innovation increases by one after allowing for the decay factor of ó 
on the previous attraction level If innovation was pursued but was un- 
successful (which occurs if either she failed to generate an idea or suc- 


” For a discussion of reinforcement learning mechanisms m general, see Sutton and 
Barto (2000) 


946 


Discovery and Diffusion of Knowledge 


ceeded 1n generating an idea which turned out to be useless), or the agent 
chose to pursue imitation instead, then her attraction measure for 1nno- 
vation 1s simply the attraction level from the previous period decayed by 
the factor p Likewise, a success or failure ın imitation ın £ has the same 
influence on B;”(t + 1) Given B?” (t) and Bi" (t), one then derives the choice 
probability of innovation ın period £ as follows 


(B()^ 


——— m 8 
BFH) + (B (P v) 


g,(b) = 


where A 50 A higher value for A means that a single success has more 
of an impact on the hkehhood of repeating that activity (innovation or 
imitation) * The probability of imitation ıs, of course, 1 — q,(£) The ex- 
pression ın equation (8) tells us that a favorable experience through ın- 
novation (imitation) raises the probabihty that an agent will choose to 
innovate (imitate) again in the future—a positive outcome realized from 
a course of action reinforces the likelihood of that same action’s being 
chosen again 

The stage-2 attractions and probabilities are derived similarly Let 
Æ (t) be agent z’s attraction to another agent 7 ın period £ It evolves 
according to the rule below 


pA (t) +1 if 2 successfully imitated 7 in £ 
2 + — 1 

Aor pA? (t) otherwise, (9) 
Vj #2 The probability that agent ? observes agent 7 ın period ¢ 1s adjusted 
each period on the basis of the attraction measures, (47(4)),., 


p; Í ——ÁÓ LL 10 
Diu (A*(0)^ l : 


Vj # 2, Wz, where A\>0 

There are two distinct sets of probabilities 1n our model One set of 
probabilities, q,(2) and (5/(8)), ,,, are endogenously derived and evolve over 
time ın response to the personal experiences of agent z Another set of 
probabilities, 4” and p”, are exogenously specified and imposed on the 
model as parameters They represent the state of existing technologies 
and control the potential capabilities of individual agents to innovate 


independently or to imitate someone else 1n the population via social 


? For analytical simplicity, we assume ¢ and ^ to be common to all individuals in the 
population 
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learning !! Of particular interest 1s understanding how these parameters 
influence the structure and performance of the network and the rate of 
innovation which we will measure by the population-average level of 
innovation 


a) = > al (11) 


Modeling Turbulence 1n Task Environment 


Central to the performance of a population 1s how 1t responds to an 
evolving environment, or, 1f we cast this in the context of problem solving, 
an evolving set of problems to be solved It 1s such change that makes 
innovation and the spread of those innovations through a social network 
so essential Change or turbulence 1s specified in our model by first as- 
signing an initial goal vector, Z,(0), to each agent and then specifymg a 
dynamic process by which 1t shifts over time In order to provide the 
possibility of a substantive network’s forming, it 1s critical to allow for 
some persistent similarity m goals across subsets of agents This ıs 
achieved by initially partitioning the population into a fixed number of 
groups, those agents belonging to the same group tend to have more 
similar goals—they are working on similar problems—than those be- 
longing to different groups As such, for any given agent, there are two 
broad sources for social learning—another agent in the same group and 
another agent ın a different group We expect the efficacy of social learning 
to depend on the tightness of the goals within a given group relative to 
the tightness of the goals between different groups To explore this issue, 
we distribute the initial goal vectors of agents ın a sequence of steps 
described below 

Letting s e (0, 1]^7, define &(s, x) C (0, 1)”7 as the set of pomts that are 
exactly Hamming distance x away from s The set of points within Ham- 
ming distance k of s 1s defined as 


A(S, K) = U &s, 2) (12) 


" One may view these as being completely determined by available technologies outside 
of our model For instance, the size of the manuscript libraries may have limited the 
magnitude of uU" ın ancient Alexandria, while the invention of the printing press 
presumably raised it ın the 15th century “Limits set by the very largest manuscript 
libraries were also broken Even the exceptional resources which were available to 
ancient Alexandrians stopped short of those that were opened up after the shift from 
script to print The new open-ended information-flow that commenced ın the fifteenth 
century made 1t possible for fresh finding to accumulate at an ever-accelerated pace" 
(Eisenstein 1979, pp 517-18) 
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A(s, K) 1s a set whose “center” 1s s 

Suppose there are J groups in the population and let us randomly 
allocate the M agents ınto these groups Let a, be the set of agents be- 
longing to group £ e (1, 2, , J} We define g, as the seed vector used 
to generate the initial goal vectors for all agents 1n a,, 


2.0) € A(g,, 6) Vi e a, Vk e (1,22  , J} (13) 


All agents ın a, then have goal vectors that lie within Hammıng distance 
K of the group seed vector g, The diversity among groups 1s modeled by 
allowing their group seed vectors to differ Specifically, we define a seed 
vector U for the entire population and randomly select the group seed 
vectors from A(U, X), the intergroup tightness of the goals 1s controlled 
through X, the maximum Hamming distance between a group seed vector 
and the population seed vector Of course, the intragroup tightness of the 
goals is controlled with x as described 1n (13) Figure 2 depicts how these 
sets are related to one another With J = 4, the four group seed vectors 
are chosen from the set A(U, X) Taking the seed vector for group 2, g,, 
A(g,, x) 1s the set of vectors that are within Hamming distance x of g, 
The initial goal vector for individual :, Z,(0), 1s then an element of this 
set 

In period £, agent z has the current goal vector of z,(t) In period £ + 
1, her goal stays the same with probability o and changes with probability 
(1 — o) The shift dynamic of the goal vector ıs guided by the following 
stochastic process The goal ini + 1, if different from 2,(4), 1s then chosen 
ud from the set of points that lie both within the Hamming distance o 
of Z,(£ and within Hamming distance k of the original group seed vector 
g, Hence, defining A(z, (2), o, g,, k) as the set of points from which the 
goal in £ + 1 1s chosen, we have 


(AG, O, PAZ) AO Alg, K) (14) 


AG, (t), P, g,, K) 


Figure 2 shows A(Z,(é), p, g,, K) as the doubly shaded area which is the 
intersection of A(Z,(8), p) and A(g,, K), minus Z,(£) Consequently, 


Z€+1)= 2 (6) with probabılıty o (15) 
Z(t + 1) € AG,(), P, g, K) with probability 1— o 

The goal vector for agent z who belongs to group & then stochastically 
fluctuates while remaining within Hamming distance p of his current goal 
and Hamming distance k of the group's initial seed vector The former 
condition allows us to control the possible size of the change while the 
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12 
latter condition allows us to maintain the intragroup tightness of goals 


> If we had instead allowed an agent’s goal to be a random walk without oe 
it to A(g,, x), then eventually, the distance between one agent’s goal and — . 7 
goal would be independent of whether they are 1n the same group so that intrag p 
tightness would have been lost 


950 


Discovery and Diffusion of Knowledge 


The lower o and the greater p, the more frequent and variable ıs the 
change, respectively, ın an agent’s goal vector The higher k 1s, the lower 
the intragroup goal congruence The higher X 1s, the greater the intergroup 
diversity ın terms of their goals 


Measuring the Network Structure 


A social network emerges when individuals rely on observation and 1m- 
itation of others A main structural characteristic of a social network is 
its concentration Does an individual learn from many or from a relatively 
narrow set of other individuals? In our context, this question can be 
addressed by observing the distribution of p’(¢)’s If an individual ıs 
equally likely to imitate any other member of the population—so that 
pit) = 14M — 1) there ıs no order in the social network as imitation 1s 
completely (that 1s, uniformly) random Alternatively, if the probability 
of observing another agent 1s concentrated on a single individual—so that 
p?(t) = 1 for some ;—then there ıs a maximal degree of order m the 
network 

An appropriate measure for this purpose 1s Shannon’s (1948) “entropy” 
which was originally defined in the context of information theory as an 
inverse measure of the information content of a message In adapting this 
measure for the context at hand, the entropy measure for the social net- 
work of agent z 1s defined to be 


E) = — 2, P10) 108,070) (16) 


The value for £,(¢) can range from a minimum of zero to a maximum of 
log, (M — 1) ” By taking an average of £,(¢) over all individuals ın the 
population, we obtain the mean entropy of the network system 


EO = DEW (17) 


We say that the population network structure becomes more ordered 
(random) as E(t) decreases (increases) 


3 E (t) = 0 when the probability mass 1s entirely concentrated on one particular ın- 
dividual k #2 so that p(t) = 1 and 2/6) = 0 Vj x k, 2 Conversely, maximal entropy 
ıs attained at E (4) = log, (M — 1) if observation is equally likely for all individuals, 
PD = 1/4M — 1) V? #2 
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SIMULATION DESIGN 


We consider a population of 20 individuals with identical capacities for 
learning “ p” = y" and 47 = y" Wz There are four (J = 4) distinct 
groups into which these agents are allocated Assuming N = 24 and 
d = 4, there are 96 total bits in a methods vector ” The initial methods 
vectors, z,(0)”s, are independent draws from (0, 1)"^ In any time period 
t, the state variables for individual z are then z,(é), 2,(2, [£O], +s B” (t), 
and B;"(t) The parameters of the model are p”, u”, o, and A, which 
govern an individual agent's decision making, and X, x, o, and p, which 
control the environment by specifying intragroup and intergroup goal 
congruence as well as the dynamics of the task environments So that 
results are not driven by the peculiarities of the initial methods vectors, 
we will focus on the steady-state behavior of the social system with the 
particular intent of understanding how ıt depends on these agent and 
environment parameters The only exceptions are that ö = 1 and M = 
1 and remain fixed Finally, the initial attraction stocks are set at 
Æ (0) = 1 Və, Vj #2, and BP(0) = B”(0) = 1 V: Hence, an individual 
is initially equally attracted to innovation and imitation and has no ın- 
clination to observe one individual over another ex ante Table 1 provides 
a comprehensive list of parameters along with the set of values considered 
over the course of the simulations 

In each case, the model 1s run for 20,000 periods For each period, 
q,(£) and (2/(2)),., are collected as well as the performance of each ındı- 
vidual, z,(£ Population averages were then computed for these time 
series, thereby giving us E(t), g(t), and 7(4) for each replication For each 
parameter configuration, we performed 20 replications—each of 20,000 
periods ın length— where we had fresh realization of the random variables 
including the initial method vectors, the agents’ choices (recall that they 
are probabilistic), the outcome of innovation and imitation, the seed vec- 
tors, and agents’ optima The time series for the relevant variables, such 
as the entropy and performance measures, were then averaged over the 
20 replications so as to generate the final time series reported here Here- 
ınafter, E(t), z(t), and q(t) will denote the final time series averaged over 
these 20 replications 


STRUCTURE OF NETWORKS 


For the baseline simulation, we considered p” = 5 and p" = 5 so that 
when ın the innovation mode, an agent generates an idea 50% of the 


" The source code for the simulation design, written ın C**, is available upon request 
from Myong-Hun Chang 


15 The search space then contains over 7 9 x 10” possibilittes 
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Definition 


Exogenous rate of innovation 
Exogenous rate of imitation 
Intergroup goal diversity 
Intragroup goal diversity 
Intertemporal goal stability 
Intertemporal goal variability 
Number of agents 

Number of groups 

Attraction decay factor 
Agent’s sensitivity to attraction 
?'s attraction to7 in? = 0 


2’s attraction to innovation ın t = 
0 
?'s attraction to imitation in £ = 0 


TABLE 1 
LIST OF PARAMETERS 


Baseline Value 


Parameter Value 


Parameter Values Considered 


(0, 25, 5, 75, 1] 

(0, 05, 1, 15,2, , 9, 95, 1) 
(0, 4, 8, 16, 32, 64, 96] 

(4, 8, 16, 32] 

(5, 75, 95, 99) 

(1, 4, 8] 
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time, and when 1n the imitation mode, an agent observes another agent 
50% of the time The stability of the task environment ıs set ato = 75, 
so that the local optimum for an individual shifts 25% of the time, and 
p = 4 so that ın case of such a shift, four or less randomly chosen bits 
ın the goal vector will flip We also assume moderate intragroup and 
intergroup diversity by setting x = 16 and X = 16, respectively 

Figure 3 shows the results from the baseline simulation of the endog- 
enous network These results are typical based on many other parameter 
configurations The mean performance of the population, shown in figure 
3(A), increases rapidly early on and then converges to a steady state 
Toward understanding how the social network evolves, figure 3(B) plots 
the entropy measure Recall that at¢ = 0, the attractions are all identically 
equal to one, so consequently, ?/(0) = 1/(M — 1) Vj #2, Wz The social 
network ıs then starting from a point of maximum entropy of 4 25 As ıs 
shown in figure 3(B), 1t then monotonically declines to about 3 64 by the 
end of the 20,000 periods, 3 64 1s equivalent to the entropy of a fully 
random network with only 13 5 agents “ The network 1s then becoming 
increasingly ordered as each agent’s imitation 1s being concentrated on 
an ever smaller set of other agents 

Given that agents focus their attention on an increasingly narrower set 
of other agents, to whom are they attracted? Is learning mutual so that 
agent z, who chooses to learn from agent 7, 1s also a primary source of 
knowledge for agent 7? How strong 1s the tendency for mutual learning 
within groups? These questions can be answered only by going beyond 
the measure of entropy and delving into the relationships between the 
stage-2 choice probabilities of agents 1n the population For this purpose, 
we impose extra structure on the simulation by randomly allocating 20 
individuals to four groups and then holding their group memberships 
fixed across the 20 replications The output of this exercise 1s reported 
in figure 4 and table 2 Letting p’(¢) denote the time series of p(t) averaged 


16 As there are five agents ın each agent’s group, it ıs not then true that each of these 
groups form their own networks This does not occur for a variety of reasons First, 
it ıs possible that agents ın other groups might be more similar though that will not 
hold on average Second, even if agents outside of one’s group are less similar than 
those ın one's group, randomness ın what agents adopt and what 1s observed could 
cause an agent to learn more from nongroup agents and thus induce the establishment 
of stronger links with them 


17 This structure 1s 1mposed only on the simulations reported ın this section For all 
other simulations, the individuals are reallocated (randomly) to the groups at the start 
of each replication 
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z(t for z"=.5 and y'M—.5 


0 5000 10000 15000 20000 
t 


E(t) for y= 5 and p'"= 5 


0 5000 10000 15000 20000 
t 


Fic 3 —Baseline time series 


over the 20 replications, agent 2's steady-state choice probability of ob- 
serving agent 7 1s defined by 


1 20,000 
b mI "(t 18 
ə $005, 25, P ) 9) 
Figure 4 plots pairs of choice probabilities, (p’, Ë), as points 1n a prob- 
ability space for all z and 7, 2 #7 Since there are 20 agents ın our ex- 
periment and each agent can then observe 19 other agents, there are 190 
distinct points ın total For each point, the horizontal coordinate 1s the 
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probability agent z observes agent 7 and the vertical coordinate ıs the 
probability agent 7 observes agent z First, note that these probabilities 
range widely from 0 02 to 0 15, though there ıs a concentration of these 
points at lower probabilities, around 0 04 More important, the plot clearly 
indicates that there 1s a positive relationship between f/ and P such that 
a relatively high (low) # 1s paired with a relatively high (low) b, In other 
words, social learning tends to be mutual For a more thorough verification 
of this property, we computed correlations between b and P, see table 2 

The correlation 1s positive ın all cases and ıs generally rather high Fur- 
thermore, the correlation appears to increase ın X—-see table 2(B) for when 
X ıs raised to 32—and decrease ın x—see table 2(C) for when x 1s lowered 
to 8 The stronger the mutuahty 1n learning, the greater the 1ntergroup 
diversity ın goals, and the lower the intragroup diversity 
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TABLE 2 
CORRELATION BETWEEN f; AND fj 


p" 25 5 75 1 
A X = 16, k = 16 
0 352657 34733 29233 480061 


25 807451 773048 664245 506862 
5 586809 823839 806357 689279 
75 406306 791755 799773 784047 


1 268945 681329 860758 821619 
B X 32, k = 16 
0 540246 60035 736072 630133 


25 901541 862012 835666 795593 
5 786882 941283 906933 878898 
75 546645 892148 93532 911084 


1 385492 837522 930428 949636 
C X =16,* =8 
0 649251 718233 741049 706987 


25 875383 851206 83393 845132 
5 870919 920375 857695 837231 
75 735823 922655 907393 90641 
1 57091 912438 905253 910782 


To delve further into network structure, let us impose a bit more struc- 
ture without losing any generality Suppose agents are initially allocated 
to different groups 1n a nonrandom way so that group 1 contains agents 
1 through 5, group 2 contains agents 6 through 10, group 3 contains agents 
11 through 15, and group 4 contains agents 16 through 20 The steady- 
state observation probabilities, #, are described ın figure 5, where agent 
vs (the observer’s) identity 1s represented along the vertical frame and 
agent 7’s (the target’s) identity 1s represented along the horizontal frame 
A scaled value of f! 1s then captured by the grey shading of the cell located 
on zth row and jth column The lighter (darker) the shading, the higher 
(lower) the probability with which agent z observes agent 7 Since an agent 
observes herself with zero probabihty, the diagonal cells are shaded com- 
pletely black The most striking feature of the figure 1s the emergence of 
four adjoining square (5 x 5) blocks of cells along the diagonal having 
lighter shades Each block measures the probabilities of various agents’ 
observing members of the same group Clearly, not only 1s learning mutual, 


957 


American Journal of Sociology 


p! endogenous probability of t imitating | 
129345 6 7 8 9 101112131415 1617 181920 


m 
C 
AŞ 
Mm 


— Ge ee ee 
NO Ban C») — CO w 


© — — 
D — 


" I 
mue" D rm 
i ri rp = E73 


Í 
— — ek x ux x en E a 
— M (tə P Q 0) — CO (O O — M t Tü C aw Ow 


— MN OC DB Ch CO) — COO 





123465 6 7 8 9 101112131415 16 17 18 18 20 
) 


Fic 5 —£ with nonrandom agent allocation (X = 16, x = 16, p” = p” = 5,9 = 75, 
p = 4) 


it 1s also more active among agents sharing similar goals ^ Considered 
together with the results reported in table 2, 1t indicates that such intra- 
group mutual learning 1s more intensive when the groups are more seg- 
regated and isolated from one another 


18 As noted earlier, Carley (1991) constructs a model of endogenous social groups in 
which the probability of interaction between agent ? and agent 7 1s solely determined 
by how much information they share However, ın her formulation, the agents are not 
goal driven and the resulting mutual learning between members of the same group 
arises from the specification of these interaction probabilities at the social level, rather 
than from the deliberate choices made at the individual agent level 
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COMPARATIVE DYNAMICS 


Having explored the emergent structure of networks, the next step 1s to 
investigate how performance 1s influenced by the reliability of the com- 
munication technology supporting the network and the productivity of 
agents 1n engaging 1n innovation Furthermore, we must investigate how 
this relationship 1s 1mpacted by features of the environment such as the 
turbulence ın the task environments and interagent goal diversity So as 
to extract away the impact of initial conditions as well as to focus on the 
long-run properties of the system, our analysis will concentrate on the 
key endogenous variables at the steady state Figure 3(A) shows that the 
social system reaches the steady state by £ = 10,000 The speed of con- 
vergence to the steady state varies, depending on the parameter values 
By taking our measurements as per-period averages over the time periods 
between 15,000 and 20,000, we are confident that our observations rep- 
resent the system's behavior on the steady state " To this end, define E, 
7, and q as the steady-state values of the endogenous variable 


20,000 20,000 


20,000 
~ Y RET 1 1 
ı En (), q: (ooo P 70) 4 = ooo PKO 


(19) 


‘? Note that the performance measure 1s an average over the 5,000 periods from £ = 
15,000 to 20,000 with that being averaged over 20 replications Two conditions need 
to be satisfied m order for this to be an appropriate measure for comparative dynamics 
(1) the long-run output must not be sensitive to minor changes ın ınıtıal conditions, 
and (2) the output must converge to a steady state by ¢ = 15,000 such that the per- 
period average from 15,000 to 20,000 approximates the mean of the limiting distribution 
of the underlying stochastic process For this purpose, we examined the output from 
a set of randomly selected replications using different initial conditions For any given 
set of parameter values, the output generated from these runs 1s quite similar, and we 
found no evidence that the long-run population averages are sensitive to initial con- 
ditions This suggests that the stochastic process generating z(t) ıs ergodic, that 1s, the 
transient distribution of the output ın period ¢ converges to a stationary distribution 
in the limit as £ — © for any initial conditions While the limit distribution 1s independent 
of initial conditions, the rate at which the transient distribution converges to ıt 1s 
generally not A computationally efficient procedure would then call for identifying 
the minimum time index, £, such that the transient distributions are approximately the 
same from that point on We have examined the time series of 7(4) averaged over 20 
replications for the baseline parameter values of X = 16, x = 16, p = 4, and g = 
75 for all p” € (25, 5, 75, 1) and all p" € (0, 05, 1, , 9, 95, 1} The simulations 
looked very similar to the one captured ın figure 3(A) In all cases, the convergence 
occurred by 4 = 15,000 See Law and Kelton (2000) for further discussions on how to 
identify the steady state 
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In the remaining part of this article, the focus 1s on exploring the impact 
of jJ", x”, o, p, X, and k on E, T, and q” 


Impact of the Reliability of the Network 


Figure 6 plots E, d, and 7 for various values of u” € (0, 05, 1. 15, 2, 
25, , 9, 95, 1}, given the baseline parameter values, p” = 5, o = 
75, X = 16, k = 16, and p = 4 Figure 6(A) shows that when there 1s 
minimum reliability, p” = 0, the mean entropy at the steady state remains 
at its initial value of 4 25 which corresponds to a fully random network 
The mean entropy gradually drops as uU" increases and then 1s essentially 
constant once u”” achieves a sufficiently high level ( 4 ın this case) Further 
improvements in reliability mildly raise entropy for reasons that are 
unclear 

Figure 6(B) shows that the endogenous probability of choosing ınno- 
vation, q, ıs near its maximum value of one when the reliability of the 
communication technology ıs at its mınımum, p” = 0 As vu” rises, the 
rate of innovation correspondingly declines as a more reliable network 
encourages agents to engage ın more imitation as well as forming a more 
useful network (as reflected 1n falling entropy) However, the 1mpact of 
a marginal gain ın p” on q varies, depending on the level of p” When 
p” 1s low, q declines at an increasing rate up to p” — 035 Beyond that 
point, g declines at a decreasing rate 

Most interesting 1s figure 6(C), where the steady-state performance of 
the social system, 7, 1s plotted as a function of the network reliability, 
p" The plot has the S-shape of a logistic curve, where the performance 
initially rises at an increasing rate until reaching an inflection point, at 
which the marginal gain 1s maximal, and then ending up 1n a region of 
diminishing returns Performance achieves its maximum level at a point 
of maximum reliability of the network 

The logistic curve shape of 7 with respect to p" 1s universal to all 
parameter configurations considered However, a notable property of fig- 
ure 6(C) 1s not universal Figure 7 plots steady-state performance as a 
function of pU" for the same parameters as ın figure 6 except that u” 1s 
lowered to 25 and x to 4, agents are less productive at innovation and 
the degree of intragroup goal congruence 1s higher Performance 1s now 
declining 1n network reliability when reliability 1s sufficiently great (spe- 
cifically, u^" = 3) Quite surprisingly, an 1mprovement ın the communı- 


? It 1s important to remember that the system never settles down as agents’ goals are 
always stochastically changing, and as a result, agents always changing their behavior 
and their methods vector These reported variables—Z£ , 7, g—are then to be interpreted 
as the means of the stationary distribution for this stochastic process 
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cation technology leads to a deterioration 1n performance Further sim- 
ulations reveal that the nonmonotonicity of 7 ın p” 1s a general property 
for some part of the parameter space In particular, the simulation results 
reported ın figures 8 and 9 show that 7 declines in p” when o 1s low, p 
is high, and/or X 1s high 

Property 1 —When the rehabihty of the network (u") 1s sufficiently 
low, steady-state performance (7) 1s increasing 1n reliability When the 
reliability of the network 1s sufficiently high and the task environment 1s 
sufficiently volatile (c low and/or p high) and/or the goal diversity among 
groups is sufficiently great (X high), performance is decreasing in 
reliability 

Why does the reliability of the network have a deleterious effect on 
performance? Recall that there are two search strategies 1n our model, 
innovation (individual learning) and imitation (social learning). For a 
given value of 5", an increase ın u” raises the rate at which existing 
useful practices diffuse in the population, which certainly improves the 
short-run performance of an average agent However, 1n the longer run, 
its impact is more subtle because agent behavior regarding ınnovatıon, 
imitation, and network links 1s adapting In particular, a more reliable 
network means that new ideas are diffused faster, imitation tends to be 
substituted for innovation, and the greater use of imitation allows a more 
structured network to form All of these effects have the implication of 
making agents in a network more alike as they observe and adopt the 
practices of others When the task environment 1s relatively stable, this 
lack of diversity 1s not a problem, and 1t 1s more critical that the faster 
social learning allowed by a aigher value of p” (and the commensurate 
fall in entropy) speeds up convergence to local optima But when the task 
environment 1s sufficiently volatile, agents have to modify their tasks 
continually For that reason, homogenization of the network 1s seriously 
harmful as it ıs less hkely that anyone ın the network will have useful 
tasks that would serve the new environment well While individually, ıt 
may make sense to utilize the network more intensively when 1t 1s more 
reliable, the ensuing lack of diversity that occurs—as imitation crowds 
out innovation—1s detrimental from a collective perspective This finding 
explains why, under certain conditions, steady-state performance can de- 
cline 1n response to a more reliable network Better communication does 
not necessarily mean higher performance when agents can adjust their 
mix of innovation and imitation and their network links 

As for the role of intergroup diversity (as measured by X), first note 
that when X ıs low, the optima of agents from different groups tend to 
overlap to a greater extent This overlap leads to social learning that 1s 
more global ın nature ın that an agent tends to observe other agents ın 
different groups more frequently than when optima are tightly clustered 
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Fic 8—Impact of o and p (X = 16, x = 16) 


around distinct groups The implication of such global learning 1n our 
context 1s that 1t allows mteragent diversity to survive over time instead 
of leading the social system to a collection of isolated homogeneous clusters 
of dividuals who are unable to adapt flexibly to changing environments 
because of the lack of diversity’! This is portrayed ın figures 9(A)-(B) 
What we have seen 1n this section 1s the critical role that persistent 


21 Note that for a higher value of X, there 1s more intergroup diversity 1n the population 
However, this diversity 1s not particularly useful for the agents, as their goals are very 
different and the likelihood of learning anything useful from the agents ın other groups 
is much lower 
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Fic 9 —Impact of X and k (o = 75, p = 4) 


diversity plays in promoting steady-state performance ın the context of a 
changing environment A simple improvement in the reliability of the 
network may harm long-run performance because of how agents adapt 
their rates of innovation and imitation Enhanced communication tech- 
nology can induce too much imitation with a resulting loss of diversity 
in responding to future environments In that case, an individual agent's 
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capacity to carry on independent innovation 1s crucial in supplying the 
necessary fuel for the effective operation of the social networks To that 
Issue, we turn next 


Impact of the Productivity of Innovation 


Given the relationship between network reliability and performance, the 
next task 1s to explore how this relationship depends on the capacity of 
agents to innovate Obviously, agents who are more productive at ınno- 
vation have an immediate virtue through enhancing individual learning 
But productivity at innovation 1s also beneficial from a social perspective 
as 1t 1mplies a wider variation 1n methods m the population which 1s the 
raw material for, in the short run, responding to a changing environment 
This suggests that more innovative agents are also beneficial from a social 
learning perspective However, as the results below will show, it 1s also 
necessary that the network be well developed for diffusing 1deas, and that 
depends not only on the exogenously determined reliabihty, but also on 
the endogenously determined structure of the network The extent to 
which a higher capacity for innovation improves performance depends 
on how well the network 1s developed, which depends on how extensively 
agents use ıt, which depends on their innovation-imitation mix, which 
depends on the capacity for innovation, thus, we come full circle The 
ensuing results are far from transparent at thus point 

Figure 10(A) reports the frequency with which agents choose to innovate 
on average, q, as a function of network reliability, p", for various 1nno- 
vation capabilities, w” € (0, 25, 5, 75, 1) The impact of x” on the 
frequency with which agents try to innovate ıs monotonic q rises as p^ 
is increased Populations with more innovative members will indeed en- 
gage ın more innovation, irrespective of network reliability The steady- 
state entropy of the social network, E, 1s plotted as a function of uU" ın 
figure 10(B) for varying values of p" Note that as u” goes up, entropy 
falls more slowly in y” but tends to converge to a lower steady-state 
value Consequently, an increase ın x” causes the network to be less 
ordered when y" 1s low, while ıt results ın a more ordered network when 
p" ıs high 

Property 2 —When network reliability 1s low, the order of the social 
network and the capacity for innovation are inversely related When net- 
work reliability is high, the order of the social network and the capacity 
for innovation are positively related 

In explaining this property, we know from figure 10(A) that an increase 
m x” induces the population to lean toward innovation relative to ımı- 
tation regardless of network reliability As there 1s less social learning, 
reinforcement learning implies the network 1s less ordered Examination 
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of figure 10(B) reveals this ıs the case when network reliabihty is low as 
entropy Is rising in p" when w” € 35 However, when network reliability 
is high, a second force comes into play because a higher value for p" also 
raises the value of learning from others since they are more likely to have 
discovered useful methods Thus, the quality of the network—in terms 
of agents’ having a diverse array of useful ıdeas—rıses with the produc- 
tivity of innovation Innovation 1s the fuel for 1mprovements 1n system- 
wide performance as 1t creates variations 1n the population which can be 
diffused among ıts members through the social network But that quality 
cannot be adequately tapped if reliability 15 low Thus, when y” 1s low, 
agents find that accessing the network 1s unproductive, as it 1s hard to 
observe other agents’ ideas Agents then respond to being more innovative 
by simply substituting away from imitation to innovation However, when 
w” ıs high, agents find that they can better access the ideas of others, and 
since agents are highly innovative, there are many worthwhile ideas float- 
ing about Increased quality and reliability of the network then work 
together to result ın a more ordered network even though innovation 1s 
more productive The key point 1s that the value of imitation and the 
value of developing links ın the network are dependent on the ınnova- 
tiveness of agents Innovation and imitation are substitutes in search, but 
through the mechanism of social learning, they are also complements 

Having described how the capacity for innovation impacts network 
structure, let us next turn to its effect on population performance Figure 
10(C) reports how x” influences the relationship between steady-state 
performance, 7, and network reliability, p” Note that the 7 curve retains 
the S-shape for all values of y^ However, the position of the curve for 
varying values of u” exhibits an interesting pattern as uy” rises, the 7 
curve shifts up and to the right This pattern leads to the rather surprising 
result that steady-state performance can decrease with the capacity for 
Innovation and thus, performance can be maximized by having moder- 
ately innovative agents 

When p”™ 1s sufficiently low, a population mainly improves on the basis 
of individual learning through innovation because of the poor reliability 
of the social network As innovation is the primary source of improvement, 
greater ınnovatıveness enhances performance as shown ın figure 10(C) 
On the other hand, for sufficiently high p”, the population extensively 
deploys both innovation and imitation As network reliability achieves 
its maximum value, there are diminishing returns to network reliability 
so that the mix of innovation and imitation 1s fairly insensitive. Once 
again, higher levels of 1nnovativeness tend to produce higher performance 
(though the relationship 1s not quite as strong as when y” 1s low) Things 
get interesting for intermediate values of 4" because the effect of u” 
becomes more complex To begin with, imitation 1s both a social good 


968 


Discovery and Diffusion of Knowledge 


and a social detriment By imitating others rather than developing new 
ideas, an agent 1s not adding to the stock of knowledge However, by 
helping to spread worthwhile ideas, an agent 1s improving the value of 
the network, and thereby, social performance When network reliability 
is moderate, agents may be engaging ın too much individual learning 
from a societal perspective It might be better to tap into the network to 
pass along ideas and develop more useful links (which can only be 
achieved through experience) Making agents more innovative induces 
them to use the network even less, thereby exacerbating this problem We 
then find that when network reliability is moderate, greater innovativeness 
can be deleterious from a societal perspective By inducing the population 
to concentrate more on diffusing local innovations across the social system, 
a reduced capacity for innovation can lead to superior performance 

Figures 11-14 explore how this nonmonotonicity of performance with 
respect to innovation productivity depends on the characteristics of the 
environment The ensuing properties are summarized below 

Property 3 —When the reliability of the network 1s sufficiently low, 
performance 1s increasing 1n the capacity for innovation When the reh- 
ability of the network 1s moderate and the task environment 1s sufficiently 
volatile (c low and/or p high) and/or the individual goals of the agents 
are not too dissimilar (X low and/or x low), performance 1s maximized at 
a moderate capacity for innovation 

The volatility ın the task environment 1s captured by 1 — o, the fre- 
quency of the goal change, and p, the variability of the goal change Figure 
11 shows performance as a function of p" for p” € (25, 5, 75, 1} and 
o € (75, 95, 99} Imitation and the development of a useful network are 
more valuable when the environment 1s changing more frequently, be- 
cause individual learning 1s woefully inadequate for keeping up with these 
changes One needs the diversity of ideas of the population at large, and 
as a complement, a well-developed network It ısın such a situation that 
making agents more innovative will encourage them to engage 1n more 
individual learning when the population of agents taken as a whole would 
do better by engaging 1n more imitation and the development of better 
links We then find that, under a moderately rehable network, a greater 
capacity for innovation decreases performance when the environinent 1s 
volatile (c = 75) but enhances performance when the environment 1s 
stable (6 = 99) For similar reasons, more innovativeness can be dele- 
terious when p 1s high, because the changes 1n the environment tend to 
be bigger on average (holding fixed the frequency with which those 
changes occur), see figure 12 

The 1mpact of intergroup and intragroup goal diversity on the rela- 
tionship between innovativeness and performance 1s reported 1n figures 
13 and 14 Of course, performance decreases both in X and k since goal 
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heterogeneity is deleterious (though heterogeneity in tasks 1s advanta- 
geous) Speaking to property 3, the opportunities for social learning and 
development of a useful network diminish as goals become more diverse 
both globally and locally Thus, the deleterious effect of a greater capacity 
for novation 1s less pronounced as agents rely more upon individual 
learning When individual learning 1s the dominant method of search, 
performance rises monotonically in the ease with which new ideas are 
generated 
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CONCLUDING REMARKS 


Progress, whether scientific, economic, or social, is driven by 1nnovation— 
which serves to produce a diversity of 1deas—and imitation through a 
social network—which serves to diffuse these ideas This study 1s, to our 
knowledge, the first to model all three primary forces—the discovery of 
new ideas, the observation and adoption of the ideas of others, and the 
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endogenous development of a social network At the individual level, 
innovation and imitation are substitutes, as an agent can choose to allocate 
effort to discovering new ideas or to observing the ideas of others How- 
ever, at the social level, innovation and imitation are complements More 
innovation provides a better pool of ideas that imitation can take ad- 
vantage of through a social network In our model, the quahty of the 
social network 1s driven by three factors The first factor 1s the exogenous 
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reliability of the communication technology which controls the likelihood 
that an agent observes another agent’s idea The second factor ıs the 
value of the ideas that are observed This 1s the result not only of ınno- 
vation, but also of how rapidly useful ideas diffuse, which depends on 
the regularity with which the network 1s accessed The third factor 1s the 
quality of the links between agents, which 1s itself the product of past 
experience with the network, as agents reinforce those links that have 
proven to be a productive source of ideas ın the past These last two 
factors are not only endogenous, but also provide an increasing returns 
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mechanism ın that a network ıs of higher quality if it is used more fre- 
quently, and agents will use a network more frequently when it 1s of 
higher quality 

By endogenizing both the ınnovatlon-ımıtatıon mix and the structure 
of the social network, our analysis has been able to uncover some subtle 
determinants of the performance of a population, in particular, the role 
of the agents’ capacity for innovation and the reliability of the commu- 
nication technology for conveying ideas While a more reliable commu- 
nication technology 1s beneficial 1n that 1t leads to a greater diffusion of 
ideas, 1t has the undesirable by-product of reducing diversity ın ideas as 
agents replace their own ideas with the most useful ones discovered and 
engage ın more imitation and less innovation This by-product 1s not 
seriously detrimental when the problems faced by agents are relatively 
stable, but when they are changing sufficiently quickly, then the loss of 
diversity means a loss of the raw material for adaptation This results in 
performance being less when reliability 1s greater From a social per- 
spective, agents are engaging ın too much imitation, though at an ındı- 
vidual level, their 1nnovation-imitation mix 1s appropriate 

A second subtle result concerns how performance depends on the ca- 
pacity of agents for innovation When that capacity 1s enhanced, agents 
invest more effort into innovation, which actually may cause performance 
of the population to decline When network reliability 1s only moderately 
good, 1t may be socially (though not individually) beneficial to engage in 
more imitation. Even though this reduces the development of new 1deas, 
it leads to a more developed social network and thus, greater diffusion 
of ideas Making agents more innovative induces them to use the network 
even less, thereby exacerbating the 1nadequate development of the net- 
work We then find that when agents endogenously determine how much 
effort to put into 1nnovation and imitation, greater innovativeness can be 
deleterious from a societal perspective By inducing the population to 
concentrate more on diffusing local innovations across the social system, 
a reduced capacity for innovation can lead to superior performance 

Let us conclude by discussing a few directions for future work Though 
the discovery of 1deas—through either innovation or imitation— was sub- 
ject to random forces, ıt was assumed that the evaluation and 1mple- 
mentation of those 1deas was flawless Alternatively, an agent's evaluation 
of an 1dea could be based on the true distance between an agent's goals 
and the new methods vector (after adopting the 1dea) plus some noise 
Along simular lines, one could assume that what actually gets implemented 
is the true ıdea plus noise This assumption 1s particularly pertinent when 
1t comes to imitation, as one 1s trying to infer ideas from what people are 
doing or writing, and such a process is far from flawless These modifi- 
cations are not only a move toward greater realism, but also, both of 
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these extensions would serve to introduce additional diversity 1n the set 
of methods present in the population As our analysis showed that a 
superior communication technology can be detrimental because it reduces 
diversity, these factors are quite pertinent to the analysis 

On a grander level, a natural extension of our model 1s to allow agents 
to be heterogeneous in their capacity for innovating and imitating Some 
agents are more creative and thereby more productive 1n generating new 
ideas when they choose to engage 1n the act of discovery Other agents 
may be more sociable or more capable of understanding the 1deas of others 
and thus find ıt easier to learn what other agents are doing Such het- 
erogeneity raises interesting questions about the properties of the social 
network To what extent does this heterogeneity lead to more order? Are 
links strongest with certain types of agents? Do agents tend to form links 
with those who are most innovative or those who are most imitative (and 
thus may be best connected to others in the population)? Finally, in our 
model, ıt was assumed that an agent’s goal—the characteristics of the 
problem being solved—was exogenous However, there may be societal 
forces that determine what 1s considered to be a problem worthy of solv- 
ing In the context of a hierarchy, problems could be mandated from 
above, and 1n less structured contexts, imitation may occur not only with 
respect to solutions, but also the problems themselves While it 1s unclear 
how the endogeneity of agents! goals should be modeled, 1t strikes us as 
important Enriching our model in these directions should generate further 
insight 1nto the forces underlying progress and growth 
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By means of extensive computer simulations, the authors consider 
the entangled coevolution of actions and social structure 1n a new 
version of a spatial Prisoner's Dilemma model that naturally gives 
way to a process of social differentiation Diverse social roles emerge 
from the dynamics of the system leaders are individuals getting a 
large payoff who are imitated by a considerable fraction of the 
population, conformists are unsatisfied cooperative agents that keep 
cooperating, and explozters are defectors with a payoff larger than 
the average one obtained by cooperators The dynamics generate a 
social network that can have the topology of a small world network 
The network has a strong hierarchical structure 1n which the leaders 
play an essential role 1n sustaining a highly cooperative stable re- 
gime But disruptions affecting leaders produce social crises de- 
scribed as dynamical cascades that propagate through the network 


INTRODUCTION 


Social traps (Platt 1973) are situations 1n which rational individual choices 
result ın an undesirable collective outcome for the social group A well- 
known example ıs the problem of establishing cooperation ın a social 
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group (Axelrod 1984) or, more generally, problems of public goods (Olson 
1965, Hardin 1968) Understanding collective social behavior, once ım- 
dividual attitudes are known, requires taking into account the interactions 
among the individuals of the group and acknowledging that these inter- 
actions are mediated by a network of social relations (Granovetter 1973, 
1978) Such a network constitutes the social structure of the group The 
embeddedness of the interactions in the social structure (Granovetter 1985) 
has been identified as an important factor ın explaining the evolution of 
cooperation (Macy and Skvoretz 1998) In the same way that the actions 
of individuals are affected by the social network, 1t has been documented 
that the network is not an exogenous structure but 1s created by individual 
choices (Lazer 2001) However, there are not many specific models of 
social dynamics that explicitly incorporate the concept of coevolution of 
individual and network (Lazer 2001) In fact, ın the long-term research 
agenda posed by Macy (1991), a central point 1s that the structure of the 
network should not be considered as given, but should be seen as variable 
Macy poses the question of how social structure might evolve in tandem 
with the collective action ıt enables This question goes beyond models 
in which some network evolution 1s decoupled from the evolution of the 
actions of the individuals in the group In the context of reciprocal altruism 
and the building of cooperation, this general question was implicitly con- 
sidered within a game theory simulation model called the social evolution 
model (SEM, Zeggelink, de Vos, and Elsas 2000, de Vos, Smaniotto, and 
Elsas 2001) In this article, we address the problem of coevolution in a 
version of the Prisoner's Dilemma (PD, Rapoport and Chammah 1965) 
in which players interact through a network that adapts to the results of 
the game and therefore to the actions of the players We focus on the 
resulting type of social structure (cohesive group vs social hierarchy), as 
well as on the dynamical mechanisms needed to produce the topological 
properties of the network of interactions that stabilize a collective co- 
operative behavior 

An important finding from our analysis—extensively based on com- 
puter simulations (Zimmermann, Eguíluz, and San Miguel 2001)—ıs the 
emergence of a process of social differentiation together with the building- 
up of a network with hierarchical relations. Starting from random part- 
nership among equivalent individuals, a social structure emerges In this 
emerging structure, the topology of the network of social relations 1den- 
tıhes individuals with different social roles leaders, conformists, and ex- 
ploiters These roles have been spontaneously selected during the complex 
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adaptive evolution of the social group that entails a learning process This 
social structure sustains global cooperation, with exploiters surviving in 
hierarchical chains ın the network This result 1s at variance with other 
simulation models, such as SEM, ın which partner selection leads to the 
formation of cohesive egalitarian clusters (Zeggelink et al 2000, de Vos 
et al 2001) the emergence of these groups ın a socially segmented pop- 
ulation, with the exclusion of free riders, seems to be the basis of the 
survival of cooperation in these other studies In considering the topology 
of the coevolved network that sustains cooperation, we find a hierarchical 
network with an exponential connectivity distribution Our results show 
that clustering 1s not needed to sustam cooperation However, the addi- 
tional inclusion of local neighboring partner selection 1n the model gen- 
erates the celebrated small world connectivity (Watts 1999a) of the 
network 

From a philosophical point of view, the concept of emergence 1s used 
1n contemporary sociology in contradictory ways, 1ts proper meaning being 
debatable (Sawyer 2001) We use it here as 1t appears in multiagent models 
of social systems (Gilbert and Conte 1965, Schelling 1978, Epstein and 
Axtell 1996, Axelrod 1997) In this context, 1t refers, as 1s often the case 
in the natural sciences, to complex dynamical behavior or properties that 
cannot be reduced to or predicted from a detailed description of the units 
that compose a system, so that the reductionist hypothesis does not imply 
a constructionist one (Anderson 1972) These ideas put forward a hier- 
archical structure of science 1n which different concepts and descriptions 
are needed at different levels The key idea 1s that sociology cannot be 
reduced to psychology, as molecular biology 1s not just applied chemistry 
(Anderson 1972) From this perspective, and recognizing the limitations 
of game theory 1n describing human interactions, we learn much from a 
metaphor like the PD when considering emergent properties that are not 
simply linked to special features of individual human behavior 

A canonical example of emergence 1s the V shape of bird flocks (Sawyer 
2001) The shape of the flock and the fact that a particular bird plays the 
role of a leader, with other birds lining up behind it, 1s a nontrivial result 
of simple interaction rules, but the leader 1s neither genetically determined 
nor externally appointed Our results parallel this example, showing how 
equivalent agents confronted with the choice of an action (cooperating or 
defecting) and with the possibility of choosing partners, differentiate and 
acquire different roles while building up a social structure This process 
is the result of interactions among neighbors that determine an entangled 
coevolution of the choice of actions and of the social network A partic- 
ularly enlightening example of the general process that we address here 
is the cooperative relations among researchers 1n a given scientific field 
A given researcher might choose to work with or not work with another 
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scientist and, depending on the degree of success of this collaboration, he 
might search in the community to find other scientists with whom a 
profitable cooperative relation can be alternatively established This co- 
evolution process results m a network of collaborations 1n which different 
scientists play different roles 

Our results might be compared with other studies of group formation 
beyond those associated with the SEM In Carley’s theory of group sta- 
bility (1991), as also happens 1n our simple model of equivalent agents, 
individuals assume multiple roles In addition, these diverse roles produce, 
at the group level, group stability or change depending on the social 
structure However, while Carley's theory 1s based on the sharing of 
knowledge, our model 1s not Stabihty 1s 1n our case strictly dependent 
on the adapting structure of the network of interactions 

In the remainder of this introduction, we give a brief discussion of the 
main components of our analysis These components are models of co- 
operation, models of social networks, and the 1mplications of the will of 
the agents that manifests itself ın the ability to make choices The second 
section of the article presents our model and some general conclusions 
from its analysis The main body of our computer simulation results are 
summarized ın a third section We conclude with a discussion of the results 
and limitations of our study 


Routes to Cooperation 


Why do people cooperate? Why 1s cooperation empirically observed when 
there 1s a conflict between self-interest and the common good? Classical 
answers to these questions are formulated ın terms of kin (Hamilton 1964), 
group selection (Wilson and Sober 1994), or cooperation based upon rec- 
iprocity (Trivers 1971, Axelrod and Hamilton 1981, Axelrod 1984) The 
first two answers are biologically based—the former on the increased 
biological fitness of kinship and the latter on the adaptive success of groups 
of cooperators However, the 1dea of social kin selection has also been put 
forward (Riolo, Cohen, and Axelrod 2001) cooperation can arise from 
similarity, and tags that identify such similarity can be based on cultural 
attributes instead of being genetically determined Indeed, cultural trans- 
mission has been invoked as a potential reason for the prevalence of 
cooperation 1n human populations (Mark 2002) Another route to coop- 
eration, based on a reputation score for each individual and mformation 
sharing (Nowak and Sigmund 1998), also emphasizes the cultural forces 
present 1n society Cooperation grounded on reputation can be seen, how- 
ever, as a form of indirect reciprocity 

Cooperation based upon reciprocity 1s often formalized through the 
iterated PD (Rapoport and Chammah 1965) The game theoretical for- 
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mulation of the PD shows that two perfectly rational agents interacting 
once and confronted with the choice of cooperating or defecting would 
both choose to defect, which corresponds to the Nash equilibrium of the 
game When the game 1s repeated or iterated, the Folk theorem classifies 
the many possible outcomes that can be sustained, particularly full co- 
operation It is the shadow of the future m the repeated interaction that 
makes cooperation sustainable (Axelrod 1984) Although basic concepts 
and results on the PD were well known 1n game theory (Binmore 1998), 
the evolutionary ideas for the selection of an equilibrium pioneered by 
Axelrod (1984, 1997) have been extremely influential They have lead to 
the consideration of the virtue of different strategies and to the concept 
of evolutionary stable strategies It 1s now well accepted that dynamical 
models of cultural evolution and social learning hold a greater chance for 
success than models merely based on rational choice This body of knowl- 
edge has been reviewed by Hoffmann (2000), while some recent advances 
in the theory have been reviewed by Axelrod (2000) 

Generally speaking, locality ıs not taken into account ın the different 
studies of models of cooperation mentioned so far Individuals or players 
interact globally through continuous changes of random pairings among 
them However, local spatial interactions introduce another possibility of 
reciprocity that is not based on history-dependent or backward-looking 
strategies In this line of thinking, spatial PD games have been introduced 
(Axelrod 1984, Nowak and May 1992) and further generalized by Lind- 
gren and Nordahl (1994) Spatial games and local mteractions were also 
introduced in economic contexts by Blume (1993) and Ellison (1993) No- 
wak and May (1992, 1993) and Nowak, Bonhoeffer, and May (1994a, 
19949) started by considering a group of individuals placed at the nodes 
of a regular two-dimensional square lattice What they showed 1s that 
cooperation might arise from spatially distributed interactions if agents 
play only with a local neighborhood 1n the lattice, clusters of cooperators 
may survive the interaction with defectors, and cooperation may be sus- 
tained This can happen by bypassing any consideration of memory and 
strategy, 1n situations when noncooperative behavior would prevail 1n the 
global game Admittedly, the game theory formulation of cooperation 
neglects interpersonal relations driven by emotional processes (Lawler and 
Yoon 1998) In spatial PD games, the fundamental relationship 1s that of 
exchange conditioned by structural position. Still, the basic mechanism 
of imitating successful neighbors ıs introduced This mechanism ıs cer- 
tainly prevalent in many human interactions It must be pointed out that 
the analysis of spatial games, especially when searching for collective 
emergent behavior in societies with a large number of individuals, requires 
the use of intensive computer simulations Some analytical results, how- 
ever, have been obtained (Schweitzer, Behera, and Muhlenbein 2002), 
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especially 1n simplified one-dimensional models (Eshel, Samuelson, and 
Shaked 1998) 

There are other routes to cooperation, among which we can mention 
optional participation (Hauert et al 2002) and stochastic collusion (Macy 
1991) Optional or volunteer participation 1s a mechanism that ıncorpo- 
rates, 1n addition to cooperators and defectors, players (loners) that refuse 
to participate 1n the game This has been shown to be an effective mech- 
anism to escape from the social dilemma without invoking any form of 
reciprocity Incorporating locality and spatial interactions, cooperators 
also tend to fare better (Szabo and Hauert 2002) Stochastic collusion 
stems from a forward-looking route to learning and adaptive behavior 
(Macy and Flache 2002) stochastic search by adaptive individuals in 
response to 1mmediate outcomes allows escape from the noncooperative 
social trap The beneficial implications of locality (bounded social space) 
on sustaining cooperation 1s another component of this mechanism (Macy 
1991) 


Social Networks 


A new field of opportunities for modeling 1s opened when considering the 
network of social interactions—links between individuals are not ran- 
domly established, but are dependant on neighborhood or interest rela- 
tionships As a consequence, clusters and stable linkages are developed, 
giving way to new aspects of cooperation in which interactions among 
individuals have an important effect, as they are crucial to the perfor- 
mance of the system as a whole (Holland et al 1986) 

The consideration of the spatial version of the PD discussed above 1s 
a step 1n the direction of considering a social network But a social network 
ıs different from a regular two-dimensional lattice, just as social inter- 
actions are different from a continuous random pairing of individuals 
The popular phenomenon of sıx degrees of separation (Guare 1990) that 
follows from the early experiments of Milgram (1967) demonstrates that 
if the average separation between two individuals ıs given by only six 
intermediate acquaintances (six links), social networks radically differ 
from the regular ones often considered ın spatial games Such a small 
world effect has been repopularized by Watts and Strogatz (1998) Beyond 
the physical and topological properties of the network, it 1s also ımportant 
to recall that, 1n economic terms, a social network has been associated 
with a social capital (Coleman 1988), ın the sense that ıt gives the basis 
for stable, repeated social interactions 

The spatial version of the PD had been revisited by Axelrod and co- 
workers (Axelrod 2000, Cohen, Riolo, and Axelrod 2001) in trying to 
understand the role of social structure and geographically based networks 
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in the maintenance of cooperation They have shown that geographically 
dispersed social networks are efficient 1n maintaining cooperation, pro- 
vided the links are stable They conclude, therefore, that the 1mportant 
aspect 1s not clustering (1e, locahty or correlation of hnkage patterns), 
but what they label as context preservation, that 1s, the continuity of 
interactions Another interesting contribution 1n this context 1s the study 
of Buskens and Weesie (2000) on the role of social structure in supporting 
cooperation via reputation By including a social structure, this study goes 
beyond those of Nowak and May (1992) and Rıolo et al (2001), and ıt 
shows that these two aspects—social structure and reputation—reinforce 
each other ın building up a cooperative collective behavior (Axelrod 2000) 
Reputation established via information sharing leads to sustained coop- 
eration ın social structures that are less rigid than the ones fixed by geo- 
graphic positions, therefore adding a higher degree of freedom 

From the early mathematical analysis of random networks (Erdos and 
Renyı 1959), there 1s a fast-growing literature devoted to uncovering the 
topological properties of technological, biological, and social networks 
(Wasserman and Faust 1984, Watts and Strogatz 1998, Watts 1999a, 
19995, Barabası and Albert 1999, Amaral et al 2000, Newman 2001, 
Newman, Strogatz, and Watts 2001), partially reviewed by Albert and 
Barabási (2002), Barabâsı (2002), and Watts (2003) Strogatz (2001) em- 
phasized that although many networks present similar characteristics, 1t 
is important to note that differences do exist The properties of a network 
that are usually analyzed are average distance between nodes, clustering, 
and degree distribution A short average distance, that 1s, the fact that 
the number of steps needed to connect any pair of individuals ın a social 
network 1s few, 1s what originally characterized a small world effect But 
it was long ago recognized (Wasserman and Faust 1984) that social net- 
works also show a high tendency to form cliques, that 1s, that friends of 
friends are also friends Such a high clustering property and a small path 
length characterize the small world networks formalized by Watts (1999a, 
1999b) These small world networks are halfway between random and 
regular networks The degree distribution 1s the probability distribution 
of the number of links or connections of a node of the network Many 
networks have a power-law distribution, which implies a scale-free dis- 
tribution of degree (Barabası and Albert 1999) However, ıt has been 
reported (Amaral et al 2000) that social networks do not generally display 
such heterogeneity in their connectivity, but instead show a more ho- 
mogeneous degree distribution, with the number of connections being 
characterized by a narrow distribution around a single scale 

Our concern 1n this article 1s not the characterization of the topological 
properties obtained ın examining a snapshot of a social network Rather, 
the question addressed 1s how the network 1s dynamically formed or how 
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a given network structure 1s reached after social agents interact for a long 
time (Zimmermann, Egufluz, and San Miguel 2004) As Watts indicates 
(Watts 1999a), networks affect the dynamics of the system 1n a passive 
and an active way Examples of the passive way are the spatial version 
of the PD game and the variants thereof previously mentioned, or the 
study of a PD game ın small world network (Abramson and Kuperman 
2001) We are here interested 1n the active manner 1n which the network 
of connections evolves by the will of the agents We seek to unveil possible 
dynamic mechanisms to achieve a small world connectivity 


Making Choices and Beyond Spontaneous Social Differentiation 


Incorporating the will of the agents seems to be a crucial component of 
any model of human behavior In the setting of a spatial version of the 
PD game m a social network, the agents should have some way of choosing 
their actions (cooperate or defect) and choosing their partners Making 
choices 1s an instrument for learning ın an adaptive evolution 

Partner and action selection has been taken into account ın the iterated 
PD as reviewed by de Vos et al (2001), but without considering local 
spatial interactions and also without reference to a social network The 
volunteering mechanism of Hauert et al (2002) does not give a choice of 
action, since agents refusing to participate are fixed from the outset, but 
it ıs an indirect mechanism to determine who interacts with whom More 
closely related to changes ın the network structure, or to the choice of 
partners, 1s the mechanism of allowing players to exit from an unsatıs- 
factory relationship with partners, as discussed by Axelrod (2000) on the 
grounds of the Edk-Group’s analysis of a set of 15 strategies related by 
the possibility of opting out (Edk-Group 2000) The possibility of exiting 
generally reinforces cooperation, but again, this study does not include 
local interactions and social networks 

The formation of a social network based on individual decisions has 
been studied by Bala and Goyal (2000), however, the individuals forming 
the network do not have dynamics on top of the network of interactions 
there 1s no game mediated by the interactions defined by the network 
Skyrms and Pemantle (2000) do consider the simultaneous evolution of 
actions and the structure of the network (fluid network), emphasizing that 
these types of models are largely unexplored A generic result from their 
study ıs that important changes ın global behavior occur when moving 
from frozen to fluid social networks Their approach 1s similar to the one 
also considered by Macy (1991) starüng from a random network, the 
network 1s regenerated at each time step of the game by random pairings 
among the agents, but the probability of a pairing depends on the previous 
results of the game This ıs a basic idea of coevolution, but ıt does not 
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take into account aspects of locality, or a social network, that albeit evolv- 
ing, has well-established, stable links among individuals 

Our starting point 1s the observation that the influence of social struc- 
ture in human behavior, with feedback between actions and social struc- 
ture, might be seen as an entangled coevolution of the choice of actions 
and the formation of a social structure Specifically, we address here the 
issue of the evolution of social networks ın the context of cooperation, 
allowing 1ndividuals both to choose actions and to choose exiting from 
unsatisfactory relationships We propose a simple model where the actions 
of individuals that form the social network are driven by their level of 
satisfaction, as they choose their actions by imitation of their best neighbor 
Moreover, we introduce soczal plastıcıty—other definitions are introduced 
by Lazer (2001)—as the capacity of the individuals to choose partners, 
being able to change their neighborhood as time goes on Evolving in- 
teractions thus appear, allowing individuals to choose different partners 
on the grounds of the obtained performance 

The significant step forward ın the results of our approach 1s that ıt 
naturally leads to a process of social differentiation Starting from random 
partnership among equivalent individuals, a social structure emerges In 
this emerging structure, the topology of the network of social relations 
identifies individuals with different social roles These roles have been 
spontaneously selected during the complex adaptive dynamics of the social 
group They are not the direct consequence of initial differences ın strategy 
or location Rather, they emerge ın a probabilistic dynamic in which the 
social network 1s constructed 


PRISONER'S DILEMMA IN AN ADAPTIVE NETWORK 


The simplest form of the PD game consists of two agents that may choose 
from either of two actions to cooperate (C) or to defect (D) If both agents 
choose C, each agent gets a payoff (reward) R, 1f one defects while the 
other cooperates, the former gets a payoff T (with T > R), while the latter 
gets the “sucker’s” payoff S (with S < R), ıf both defect, both get a payoff 
P Under the standard restrictions T>R>P>S,T+P<2R, defection 
is the best choice 1n a single-shot game (Nash equilibrium) Thus, the 
social dilemma of how cooperation may be sustained arises, due to the 
fact that rational agents would defect When no social structure 1s as- 
sumed, agents are drawn randomly ın pairs to play the game, and the 
dynamics may be described by a replicator-type equation (Hofbauer and 
Sigmund 1998) This equation can be understood from the biologically 
motivated fact that strategies with fitness greater than the average ın the 
whole population replicate with a positive rate, while those that under- 
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perform compared to the average die away In our context, if II, 1s the 
average payoff obtained using strategy C, and (II) the average payoff for 
all the population, then the time evolution of the fraction of the agents 
using strategy C, labeled fL, obeys the equation 


de — folle — (11) () 
Given the payoffs of the PD game we obtain 
We = Re + — o5, (2) 
(I) = RE + (S + Ta = fe) + Pa —Se)’, (3) 
and thus 
die e pyr — — pua — 
dt = fcd — fo) (R — Tuc + (S — PX — fe) (4) 


The analysis of this equation indicates that 1t has two equilibrium solu- 
tions inf. € [0,1] fc = Oısastable fixed point, whilef. = 11s an unstable 
fixed point Thus, ıt also follows from this dynamical analysis that ın the 
absence of an interaction network describing a social structure, the only 
stable solution 1s a pure defective state 

In the spatial version of the PD (Axelrod 1984, Nowak and May 1992, 
1993, Nowak et al 1994a, 19945), agents sit on the nodes of the network 
and they play the game with their neighbors Two agents are said to be 
“neighbors” if they are directly connected by a link of the network A 
detailed analysis of the different spatiotemporal patterns that can arise 
in the dynamics shows a phase space where cooperators and defectors 
can coexist depending on the parameter values (Schweitzer et al 2002, 
Lindgren and Nordahl 1994) There are threshold values 1n parameter 
space beyond which defection dominates the whole network 

We will consider a spatial version of the PD, generalized 1n order to 
consider adapting evolving networks of interaction instead of fixed, reg- 
ular lattices 


Social Plasticity 


We want to study how the results obtained with fixed interaction networks 
change 1f a certain soczal plasticity 1s allowed This term addresses the 
common observation that social interactions ın most societies adapt ın 
time by a learning process We will implement a rule for social plasticity 
that depends on both the strategies and payoffs of the individual players 
The motivation of this rule comes from the following analysis of the 
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mutual benefit obtained from each pair of allowed PD interactions two 
symmetric (C—C and D-D) and one asymmetric (C-D) Clearly, the sym- 
metric interaction C-C should be reinforced because both agents get the 
maximum payoff from their selected strategies However, the opposite 
occurs ın a D-D interaction, where both agents have aligned incentives 
to change neighbors and to possibly find a C neighbor The asymmetric 
interaction C-D forms an intermediate class, where the C agent will not 
support the interaction, but the D agent will try to reinforce it As a first 
approximation to the problem, we assume that this type of interaction 
does not change, because the overall effect of both agents 1s balanced 
From this analysis, we conclude that the simplest nontrivial social plas- 
ticity rule should allow interactions among two D agents to adapt, pro- 
viding a self-interest mechanism of social adaptation where the agents 
can increase their payoffs by changing their partners 


The Model 


We consider a fixed population of agents placed im the nodes of a network 
and connected by links to their neighbors The dynamics evolve ın discrete 
time steps divided 1n three stages, starting from a random choice of strat- 
egies and a random network 


1 Interacting —Each agent plays the PD game with its neighbors 
and collects an aggregate payoff, II 

2 Strategy update —Each agent updates its current strategy, ımı- 
tating the strategy of the neighbor with the largest payoff ın- 
cluding itself (whenever more than one equivalent neighbor with 
a larger payoff exists, one of them 1s randomly selected) 

3 Neighborhood update —If agent ? imitates a defector, then the 
agent 2 replaces with probability p this lınk with the ımıtated D 
neighbor with a new one pointing to a randomly chosen partner 
from the whole network This process updates the network 


In order to clarify the rules of the model, we would lıke to make the 
following points First, we specify that m the interaction step, we only 
consider bidirectional (undirected) links, so 1f agent z plays with 7, then 
we also assume that 7 plays with z Second, we do not consider complex 
strategies which involve the history of past encounters with neighbors (as 
in Lindgren and Nordahl 1994), but we consider instead only zero-memory 
strategies, and assume that each agent plays the same strategy (action) C 
or D with all its partners Third, the game 1s played synchronously, that 
1s, at each discrete time step, agents decide their strategy ın advance and 
they all play at the same time 

We consider in figure 1 an example that illustrates the rules of the 
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(a) (b) 


a a 


FIG 1—(a) At time step £, the system 1s composed of cooperators (white circles a — e) 
and defectors (black circle f) whose interactions are given by the links Since the payoff 
matrix is given by P=S =0, R=1, and T = 13, the payoff for each agent is 
UL TIR = {1,2,3,2,2,2 6} Therefore, the cooperator c 1s leader, because it has the 
maximum payoff Agents a and b get a lower payoff than their common neighbor, the 
defector f However, agent a has another neighbor c that gets an even larger payoff than 
f (b) In the next time step ¢ + 1, a 1mitates cooperation from c, which has the largest payoff 
among its neighbors, and only agent b changes strategy and imitates the D strategy from 
f The figure also shows that b broke the lınk with agent f and started a new interaction 
with the randomly selected agent a 


model The system 1s initially formed by five cooperators (agents a to e) 
and one defector (agent f) According to the payoff they obtain, agents a, 
d, and e are unsatisfied, and they will imitate in the next time step the 
cooperator c Agents c and f are satisfied because they do not have a 
neighbor with a larger payoff Finally, agent b 1s also unsatisfied, but the 
agent with the largest payoff ıs the defector f Thus ın the next time step, 
it changes its strategy to become a defector and (with probability p) breaks 
the link with agent f and selects a new neighbor as partner (1n this case 
it selects agent a) Note that in the neighborhood update rule, it 1s not 
necessary for an agent to change its strategy from cooperator to defector 
in order to sever a link This example shows that the neighborhood update 
rule facilitates the survival of the defectors by increasing their payoff 
when a cooperator 1s selected at random 

The strategy and neighborhood update rules implemented ın this model 
represent a useful device to unveil basic mechanisms behind global co- 
operative behavior Other devices are discussed later 1n section 4 On the 
one hand, the extreme learning (imitation) rule of the strategy update can 
be justified by the psychological bias to focus on confirmation and neglect 
disconfirmation of beliefs (Strang and Macy 2001) Bounded rational 
agents seek to learn from limited and biased information they respond 
to perceived failure by imitating their most successful peer In our example 
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of scientific collaborations, each scientist takes as a model the most suc- 
cessful among his collaborators rather than, for example, alternating be- 
tween imitating both successful and not-so-successful collaborators 

Regarding the neighborhood update rule, we assume first that only D— 
D links are broken, and second that any targeted agent always accepts 
a new partner This rule can be justified as a conservative assumption 
when considering minimal conditions for the emergence of cooperation 
As discussed in section 4, alternative rules allowing cooperators to sever 
their links would further enhance cooperation The second assumption 
can be justified by invoking the absence of cost to sustain a link Within 
this assumption, the new link may only increase the payoff, never decrease 
it, and thus 1t will always be accepted In the context of scientific collab- 
oration, if the cost to sustain a research collaboration was zero, then 
scientists would accept offers from any collaborator with the expectation 
of getting benefit Beyond sustaining a hnk, there 1s a cost to establish a 
new one In our model, the plasticity parameter p measures how easy ıt 
ıs both to sever a collaboration and to find a new partner While for 
p = O this cost 1s extremely large (so large that ıt prevents the formation 
of new links), for p = 1 the cost ıs small 

Finally, although in real social dynamics one experiences both spon- 
taneous creation and suppression of interactions, for simplicity, our model 
assumes that unsuccessful relations are replaced by randomly selected 
new ones, always leaving the total number of links constant 

Our strategy update introduces the first classification of the agents in 
the network satisfied and unsatisfied agents An agent 1s satisfied if it 
does not imitate any other agent, thus having the largest payoff ın its 
neighborhood, otherwise ıt ıs unsatisfied Thus, strategy zmztatzon and 
social plasticity (adaptation) 1s restricted only to unsatisfied agents 

The probability measures the social plasticity of the agents, controlling 
the rate at which the network structure evolves, as compared to the 
timescale of evolution of the strategies For values of p <1, strategies 
change much faster than network evolution (a situation similar to the 
frozen network of = 0), while for ö = 1, strategies and network evolve 
at the same rate (fluid social network) 


Stationary States 


The proposed model naturally leads to a time evolution of the local con- 
nectivity of the network, featuring agents with heterogeneous neighbor- 
hoods Starting from an initial condition of random partnership with links 
randomly placed among initially equivalent agents, the feedback between 
choice of strategies and choice of partners results in a dynamical evolution 
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from which a well-defined social structure 1s expected to emerge 1n the 
form of stationary states 

In our context, a stationary state 1s reached when both the strategy of 
each agent and its neighbors remain fixed ın time Given the discrete 
nature of the dynamics and the finite number of possible states, these 
states may be reached ın a finite number of time steps The simplest 
stationary state consists of all agents being cooperators Note however, 
that a state composed exclusively of defectors 1s a stationary state only 
when the network ıs fixed (P = 0), for p > 0, the agents’ strategies remain 
fixed, while the network 1s continuously evolving According to the dy- 
namical rules of the model, each agent’s being a defector 1s not a stationary 
configuration, even 1n the case of a social network where each agent has 
the same number of neighbors (remember that 1n the case of several 
neighbors sharing the largest payoff 1n the neighborhood, one of them 1s 
picked at random) In an all-D network, only interaction links change, 
but not strategies For the sake of clarity, we will refer to this state as 
the all-D network state, even 1f the interactions are not stationary Our 
system has a multiplicity of different stationary states, and the system 1s 
expected to reach one of these states This assumption 1s confirmed by 
numerical simulations The specific stationary state reached by the system 
depends on the stochastic process that shapes the network evolution A 
proper characterization of the general properties of these stationary states 
relies then on statistical tools 

Two requirements are needed for a stationary state to exist The first 
is that there be no links between two defectors, so that the neighborhood 
update rule (P > 0) does not produce any change The second condition 
relates the respective payoffs 1n each neighborhood of any cooperator : 
interacting with a D agent, say d Their respective payoffs must satisfy 


II, > II, » II, (5) 


where agent 7 1s the cooperator imitated by z Thus, a stable situation 
occurs when the payoff of defectors 1s accommodated “in between" the 
payoff of two cooperators, and they only explozt cooperators Whenever 
the payoff of d becomes larger than the best neighbor of z, the configu- 
ration 1s no longer stationary, because the cooperator z will switch to 
imitate the strategy of agent d Thus, in the stationary state, defectors do 
not interact with other D agents, and no cooperator imitates their strategy 
An interesting consequence of this result 1s that the agent with the largest 
payoff 1s a cooperator if the system settles to a stationary state However, 
this does not prevent a defector from holding the largest payoff in a 
transient state 

This simple analysis highlights the role that the “imitation of the best 
neighbor,” coupled to the social plasticity rule, has in shaping the social 
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structure of the system A related consequence 1s that the social structure 
developed becomes hierarchical ın terms of payoff and dynamics, as we 
will show below One way to visualize the hierarchal structure of the 
network 1s by constructing a subnetwork referred to as an zmztation net- 
work, where each dzrected link indicates who 1s imitating whom As in 
stationary states, no other agent imitates defectors, ın this representation, 
D agents are isolated 

Figure 2 shows part of the imitation network 1n a stationary configu- 
ration The agents ın the outer layers imitate the ones ın the inner layers 
by our previous definition they are unsatisfied agents In a stationary 
state, cooperators form trees of agents hierarchically ordered according 
to their payoff This description highlights some special agents that do 
not imitate any other agent the agent at the top of the chain has the 
highest payoff of the tree to which 1t belongs, and it 1s the only agent that 
ıs satisfied ın the tree They are easily recognized at the top of the trees 
All other agents ın the tree are unsatisfied, but they ımıtate the same 
strategy they were playing in the previous time step An implication of 
this representation 1s that the size of the group formed by following dı- 
rected links to the leader gives an indication of the influence of the leader 
with respect to the rest of the system In a stationary state, a leader has 
no links to D agents Among the leaders, there 1s an absolute leader defined 
as the one with the largest payoff in the system As a consequence, the 
absolute leader 1s also the agent with the largest number of links in the 
imitation network All the links of the absolute leader are ımıtatıon links 
shown ın the imitation network, any other leader ın the system has a 
smaller number of cooperating partners The whole system 1s dominated 
by the few leaders that control a large fraction of the population The 
structure of the social network 1s then expected to be very sensitive to 
perturbations acting on those leaders 

When the system 1s close to a stationary state (1e , 1f there are a few 
cooperators not satisfying eq 5, as happens when, for example, a coop- 
erator imitates a defector), then the imitation network 1s useful ın un- 
derstanding how this “perturbation” will propagate along the rest of the 
social structure Note that at each time step, the D strategy replicates on 
all those agents connected to the agent where the perturbation started, 
causing an “avalanche” of replication events This chain of events may 
end in an all-D network, or, because of the existence of cooperator leaders, 
cooperation may recover, as we will show below 


Social Differentiation 


Our previous description of the resulting stationary states also offers a 
description of how our social model with network adaptation naturally 
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FiG 2 —Imitation network obtamed from a numerical simulation with T = 175, R = 
1, P = S = 0, and p — 010 The simulation starts from a random network with N = 
10,000 agents and a number K = 8 of average links per node of the network Agents are 
organized m imitation layers coded with the same gray level Starting from the outer layer 
of a tree, new nodes are introduced for each new link in the path to reach the leader at the 
top of the tree Each agent imitates the agent of the inner layer to which it is linked ın the 
tree The network contains several trees Three leaders ,, L,, and L,) are identified L, 1s 
the absolute leader with the largest payoff 1n the system Its tree contains approximately 
half the number of agents of the system All nodes are cooperators, because defectors are 
isolated ın the imitation network Most of the network’s nodes, which belong to the lowest 
layer of the hierachical structure, are not shown for clarity 


gives rise to a process of social differentiation with the spontaneous emer- 
gence of different social roles Even if all the agents are driven by the 
same dynamical rules and they are initially statistically equivalent, their 
role in the network diversifies Our previous analysis of the imitation 
network allows us to identify three types of agents ın a stationary state 
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1 Leaders —Satısfied cooperators that have the maximum payoff ın their 
corresponding neighborhood The absolute leader 1s the agent with the 
largest payoff ın the whole network, and its corresponding group of 
influenced agents 1s 1n general the largest Typically, the other leaders 
have also a large number of links and a payoff above the average 
Admittedly, the sociological concept of leader has several different char- 
acteristics, not all of them included 1n our definition Still, we use this 
term to emphasize that these are agents strongly influencing other agents 
to adopt their strategy In this context, leaders do not have the will to 
seek their leadership, nor do they try to preserve it 

2 Conformists —Unsatisfied cooperators, that 1s, cooperators that do not 
have maximum payoff ın their neighborhood, but imitate other agents 
by playing their same strategy They constitute the large majority of 
the nodes of the 1mitation network except for the leaders 

3 Exploievs —Defectors that take advantage of others’ actions Defectors 
have a larger payoff then their C neighbors, showing they succeed ın 
exploitation and are satisfied In the imitation network, they do not 
have any directed link, because ın the stationary state, no other agent 
imitates their strategy 


SIMULATING SOCIAL DYNAMICS 


To facilitate comparison with previous results of the PD game in fixed 
networks (eg, Nowak and May 1992), we have performed numerical 
simulations of the above model using as main parameters the incentive 
to defect b = T ın the range 1 <b <2 and the social plasticity p € [0,1] 
The rest of the PD payoff matrix parameters have been kept fixed, 
R = 1, P = 0, and S = O as in Nowak and May (1992) It has been 
previously found (Lindgren and Nordahl 1994) that turning P = S = 0 
does not change the main results, although the single PD game 1s not ın 
the strict PD conditions We have made sure that changing the parameter 
P in the region (0, 0 1) does not significantly affect our results 

All simulations were performed with N = 10,000 agents The strategy 
initial condition was always set to 60% of randomly distributed cooper- 
ators The network mitial condition was set by distributing KN/2 undı- 
rected links between random pairs of nodes The number K corresponds 
to the average connectivity per agent, and we considered K = 8 The 
initial degree (number of links of each agent) distribution constructed in 
this manner 1s a Poisson distribution with parameter K Simulations with 
other values parameters have been performed, and no qualitative dıffer- 
ences were found Likewise, we have ensured that using an asynchronous 
updating in our simulations does not change any meaningful qualitative 
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result For the above parameters, a stationary state 1s reached 1n a time 
scale that depends very much on p, requiring about 1,000 time steps for 
p = 001 and about 10 time steps for p = 1 For smaller N the typical 
time to reach a stationary state decreases, while for larger K 1t increases 
A proper characterization of the general properties of these stationary 
states relies then on averaging over many different numerical experiments 
Our results are given as averages over 100 different experiments 

Simulations 1n our adaptive dynamic network game show how coop- 
eration 1s in general enhanced, and no threshold to an all-D network 1s 
observed In fact, our results indicate that for a given set of parameters, 
there 1s a coexzstence of a multitude of cooperative stationary states and 
the all-D network state This means that for a fraction of the initial 
conditions, the stochastic dynamics of the network may lead either to the 
all-D network or to a stationary state Once ın the all-D network, the 
system gets trapped, and no recovery 1s possible (remember that an 1n an 
all-D state the interaction links change, but not the strategies) The prob- 
ability of reaching the trapping all-D network decreases as the number 
of agents increases (ıt also decreases as the incentive to defect b decreases 
and the initial distribution of cooperators increases), and thus the trap 
appears to be a finite size effect For instance, for an incentive to defect 
b = 16 and plasticity p = 001, only 6% of the realizations get trapped 
for a large population V = 10,000, while for V = 1,000, 80% get trapped 
Here we concentrate on stationary states where cooperators coexist with 
defectors 


Fraction of Cooperators 


Figure 3 shows a first global characterization of the stationary states for 
different values of the parameter p and the incentive to defect b The 
fraction of cooperators, f-, 1s measured for a fixed network (p = 0) and 
is shown to decrease with b, approaching zero at a threshold value of 
b x 185 Thus, context preservation (Cohen et al 2001) without social 
plasticity provides partial cooperation In clear contrast, for positive f, 
social plasticity facilitates the establishment of a highly cooperative state 
with a fraction of cooperators essentially independent of the incentive to 
defect b Even for small plasticity ?, the fraction of cooperators 1s above 
90% 1n the range of parameter b considered A similar result 15 obtained 
for other values of the average connectivity K >2 In this sense, context 
preservation 1s not a necessary condition to build up cooperation, but 
rather the social structure 1s a consequence of an adaptive dynamics in 
which cooperation 1s greatly enhanced 

It 1s worth mentioning that the result of enhanced cooperation works 
against the direct effects of the neighborhood adaptation rule, introduced 
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Fic 3 —Average fraction of cooperative agents, f., for different values of the plasticity ? and incentive to defect, b For a fixed network (p = 0), 
fc decreases with b However, ın the presence of social plasticity (p # O), fc 1s kept above 90% 
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to facilitate the survival of defectors Defectors are allowed to find new 
cooperators to exploit, exiting from unsatisfactory interactions with other 
defectors However, the final outcome 1s that the number of defectors 
decreases It seems that successful defectors become isolated because they 
are role models their victims “run away” The new defectors—those that 
imitated successful defectors—establish links with cooperators that have 
a high concentration of other cooperators 1n their neighborhoods Thus, 
the formerly exploited cooperators, now defectors, turn again into coop- 
erators To achieve cooperation, it seems that cooperators need enough 
cooperator partners to have a payoff higher than any defector This hy- 
pothesis 1s analyzed 1n the following sections 


Distribution of Cooperators’ and Defectors’ Payoffs 


The total average payoff 1s larger 1n the dynamic network (p # 0) than 
for a frozen network (p = 0) However, we have measured systematic 
differences when considering the average payoffs II, and II, for each of 
the respective subpopulations D and C While the number of cooperators 
is larger, their average payoff 1s smaller than that of the defectors, re- 
versing the situation of a frozen network (see fig 4) The payoff distri- 
bution 1n a stationary state for each subpopulation 1s shown 1n the inset 
of figure 4 for a particular value of b This graph shows that the most 
probable payoff 1s larger for defectors, also explaining the larger average 
payoff of the defectors However, close inspection reveals that the distri- 
bution for cooperators always has a larger tail, indicating that a number 
of cooperators have a payoff larger than any other defector in the network 
Most of these agents are leaders and are necessary for a cooperative 
stationary state to exist 

Thus, the main characteristic of our adaptive social game 1s that an 
altruistic social network, that 1s, a network composed of a large number 
of cooperators, may develop with a few defectors that have a large average 
payoff The neighborhoods of the agents adapt to conform to the require- 
ments of the stationary states (D agents only interact with cooperators), 
and the cooperator with the largest payoff ın each branch of the imitation 
network corresponds to a *leader" with a payoff much larger than the 
average, whose strategy all other agents ımıtate 


Transient Dynamics, Leaders, and Social Crisis 


The evolution toward a stationary state 1s typically not a smooth, mo- 
notonous buildup of a globally cooperating state Starting from a random 
initial condition, the transient dynamics are characterized by small fluc- 
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Fic 4 —Dıfference between the average payoff of defectors, (II,), and cooperators, 
(II), as a function of b for p = O (stars) and p = 010 (squares) The effect of the social 
plasticity ıs that the defectors get a larger payoff on average (IL, — II.) is positive for 
$ # 0 Inset probability distribution (normalized to each subpopulation) of individual pay- 
off, for cooperators (solid line) and defectors (dotted line) The most probable payoff, which 
corresponds to the maximum of the distribution, 1s larger for defectors than for cooperators 
However, there are a few cooperators getting the largest payoff in the population, as 1s seen 
m the tails of the distribution for large values of II Parameter values b = 1 75,p — 01 


tuations and distinctive, large oscillations in the proportion of cooperators 
as a function of time that can be understood ın terms of social crisis 
An example of a frustrated attempt to build cooperation 1s the evolution 
of the proportion of cooperators shown ın figure 5 Starting from a non- 
stationary state of low cooperation ın the initial fixed random network 
for £ « 200, network dynamics then lead to a social network with a high 
degree of cooperation after several large oscillations ın the time interval 
220 «t «300 The large oscillations ın f; are frustrated attempts to build 
cooperation They indicate that the defecting behavior 1s so rewarding 
that the cooperation has to find a specific network configuration 1n order 
to be robust against eventual changes of strategy In such a configuration, 
the most connected agent (the one that has the largest number of links) 
in the imitation network 1s also the one with the largest payoff In the 
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Fig 5 — Time series for the fraction of cooperators fo for b = 17 The evolution 1s mn: 
network dynamics are switched on, so that p = 1 for £2 200 At tıme? = 500 the netwol 
how cooperation may recover after a social crisis After an oscillatory transient, a new statio 
configurations of actions are shown at the indicated times A white (black) point 1s a C (D; 
spatial location of the agents does not have any meaning because of the building of a net 
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frustrated attempts to reach a global stationary cooperative state, the 
proportion of cooperators becomes large A few time steps before. reaches 
each maxima, a cooperator receives a new link from a defector with a 
larger payoff and switches to D strategy, starting a social crisis Given 
that at those time steps there 1s a large proportion of cooperators 1n the 
network, an avalanche of D-strategy imitation starts This avalanche will 
certainly affect the neighbors of a cooperator leader, thus decreasing the 
payoff of most leaders Precisely when the avalanche begins, there 1s a 
defector with a payoff larger than the absolute cooperative leader The 
social crisis propogates through most of the system, producing a change 
in the connectivity of the social network (Zimmermann et al 2001) The 
initial distribution of links for the C and D populations are Poisson dis- 
tributions around the average connectivity, typical of random networks 
At a local maximum of f,, the two distributions have exponential tails for 
large values of inks Then, very rapidly, the network switches to the 
almost defective solution with a large number of D-D links and Poissonian 
degree distributions However, the existence of a small number of coop- 
erators with a large payoff permits, thanks to the plasticity of the network, 
the gradual buildup of cooperation by creating C-C links This requires 
two steps First, D-C lınks are created (initiated by the defectors), and 
then the defectors imitate the more successful cooperator 1n a later stage 
Finally, in the stationary state, the distribution of links for the D popu- 
lation becomes very narrow, while the distribution for the C population 
displays a tail approaching an exponential decay The stationary network 
configuration 1s thus dominated by a few cooperators—the leaders—with 
a large number of links (the tails of the distribution of links for cooper- 
ators) These highly connected agents dominate the collective behavior of 
the network 

These characteristic dynamics reveal the functzonal property of C lead- 
ers, on the one hand, through the imitation process, they sustain coop- 
eration, while on the other, they enhance cooperation whenever the C 
leader has the largest payoff of the whole network In fact, there 1s a sort 
of competition between the leader and the defector with the largest payoff 
We remark that the latter process 1s due to social plasticity whenever a 
defector selects a leader for partnership, there 1s a great probability that 
the leader has a larger payoff and the defector will imitate cooperation, 
enhancing the number of cooperators 

Figure 5 also illustrates the sensitivity of the Stationary network struc- 
ture to exogenous perturbations acting on the leaders, which reflects their 
key role ın sustamıng cooperation At time £ = 500, the system already 
has reached a stationary state However, at this time step, we have forced 
a strategy switch of the cooperative absolute leader, leading once again 
to a social crisis similar to the one previously described 1n the endogenous 
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dynamics In summary, the system self-organizes into one of several pos- 
sible cooperative states where avalanches and social crises are hkely to 
occur following spontaneous, focused local perturbations 


Structure of the Social Network 


A first characterization of the topology of the stationary network reached 
by the dynamics 1s given by the degree distribution, that 1s, the probability 
distribution of the number K of links of a node This distribution has a 
long tail that distinguishes 1t from the Poisson distribution of the random 
network This is reflected ın the values of the normalized standard de- 
viation o, = o/K shown ın figure 6 for different values of p and b We 
recall that for a Poisson distribution, o, = K 2, while for an exponential 
distribution o, = 1 We find that for small values of b, the distribution 
departs significantly from the Poisson distribution only for large values 
of the plasticity parameter f, while for increasing b, the tail of the dıs- 
tribution expands and approaches an exponential form In other words, 
the hierarchical structure of the network 1s accentuated as b increases, 
with fewer leaders that have a larger payoff 

The distribution of payoffs (inset of fig 4) ıs similar to the degree 
distribution because that payoff of the cooperators 1s given by the number 
of hnks with other cooperators, and there 1s a very small number of 
defectors ın the stationary state Therefore, the variations of o, also pro- 
vide a measure of social inequality ın the network In fact, o, 1s closely 
related to the Gini coefficient (Kakvvanı 1980) used in the characterization 
of economic inequalities ın a social group We have measured the Gin 
coefficient of the resulting network configuration ın a stationary state and 
found that it can be twice as large as one of a random (Poissonian dis- 
tribution) network Social plasticity generates a flux of payoff toward 
richer individuals 

Finally, we address the question of whether the social structure gen- 
erated 1n our dynamical model has the characteristics of a small world 
network Two requirements have to be fulfilled the clustering (or cliqu- 
ishness) has to be much larger than in a random network, and the average 
path length between two nodes should be similar to that of a correspond- 
ing random network The clustering coefficient, c, measures the fraction 
of neighbors of a node that are connected among them, averaged over 
all the nodes 1n the network Most very complex networks show a clus- 
tering larger than ın random networks given by c,,,, = KIN (Amaral et 
al 2000) In our original formulation, numerical simulations show (fig 7) 
that for increasing b the clustering coefficient increases very shghtly with 
respect to the clustering of a fixed random network (1 e , c/c,,,,q may change 
up to 106) We have tested a shght enhancement of the network adap- 
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Fic 6 —Normalızed standard deviation o, of the degree distribution of the social networks obtained for different values of the plasticity parameter 
p and the incentive to defect b For a Poisson distribution and K = 8, o, = K ?5 = 035, while for an exponential distribution, o, = 1 For p = 0, 
o, takes the value of a Poisson distribution As p and b are increased, the interaction network departs from a Poisson distribution and approaches a 
distribution with an exponential tail 
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Fic 7 —Normalızed clustering coefficient c/c,,,4, with c,,,, = KIN Filled circles corre- 
spond to p = 1 and g = 0 Other curves are for q = 001 and ? = 001 (diamonds), p = 
0 1 (squares), and p = 1 (empty circles) Social plasticity (P # 0) and a local selection of 
new partners (q + 0) are needed ın order to reproduce the clustering observed m social 
networks 


tation which easily accounts for high clustering Very often, new acquain- 
tances are made based on the relationships of current neighbors To 1m- 
plement this idea, the social neighborhood adaptation step of our original 
formulation was augmented 1f a neighbor 1s replaced by a new one with 
probability q, a local selection of a new partner happens among the neigh- 
bors of the neighbors, while with probability 1 — g, the previous random 
selection 1s performed The limiting caseg = O corresponds to our original 
formulation, whıleg = 1 corresponds to the case that all the new partners 
are chosen from the neighbor’s neighbors It 1s natural that this new 
mechanism will increase the clustering Numerical simulations show that 
while most of our previously discussed results are qualitatively ındepen- 
dent of the value of q, with a very small value of q, the clustering coefficient 
becomes very large For instance, 1% of local partner selection 1s enough 
to increase c 100 times, being the clustering largest for a slow evolution 
of the network (p< 1) As for the second requirement for small world 
topology, we find that the average path length remains 1n all cases very 
close to that of a random network Together, our results indicate that, 
allowing for local partner selection, the social network generated 1n our 
adaptive dynamics has the structure of a small world network 
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DISCUSSION 


We have presented a minimal model that incorporates simultaneous and 
coupled evolution (coevolutzon) of the strategies of the agents and of the 
social network, providing a first step 1n the investigation of the processes 
of social differentiation ın a globally cooperating social group Dıfferent 
agents play different social roles that are acquired through social inter- 
action and are not externally imposed or determined by genetic mecha- 
nisms Rather, the roles emerge from the self-organizing dynamics of the 
complex system It 1s important to note that the initial differences in 
number of links among the agents do not determine the final roles of each 
agent, since each agent temporarily changes its role 1n the stochastic dy- 
namics until a final stationary state 15 reached Moreover, we have made 
sure—provided the initial condition of a random network ın which each 
agent has the same number of links—that the system dynamics lead to 
the same emergent role differentiation 

Our results indicate that cooperation 1s stabilized by a hierarchical self- 
organized structure, so that the formation of cohesive clusters of coop- 
erators with the exclusion of free riders 1s unnecessary for persistent global 
cooperation Allowing for local partner selection, this self-organization 
leads from a random network to a final social network with the topology 
of a small world network, demonstrating how small world connectivity, 
in which clustering 1s larger than in a random network, can be dynamically 
achieved 

Our study reveals that 1f defectors have the ability to choose partners— 
breaking interactions with other defectors—the number of cooperators 
increases, but the average payoff of cooperators 1s less than that of greedy 
agents The dynamics naturally generate leaders—individuals getting a 
large payoff that are imitated by a considerable fraction of the population, 
conformists—unsatisfied cooperative agents that keep cooperating, and 
exblosteys-—defectors with a payoff larger than the average obtained by 
cooperators The most prominent role 1s the one of leader, a cooperator 
that not only sustains cooperation, but also drives the whole system to- 
ward more cooperation Defectors are found to remain stable whenever 
they explozt cooperators The formation of a social hierarchy ın the pop- 
ulation 1s the source of possible unstable behavior It promotes the oc- 
currence of social crisis that can affect a large fraction of the population 
These crises take the form of global cascades that might be easily trıggered 
by the spontaneous change ın action of a highly connected agent This 
result hıghlıghts the 1mportance of highly connected agents that, as 1l- 
lustrated by the imitation network, play a leadership role in the collective 
dynamics of the system Such sensitivity of the network stability to local 
special perturbations reveals an interesting feature of globalization the 
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group of agents organizes itself into a state where an exogenous or sto- 
chastic perturbation may produce drastic changes, at distance, within a 
limited time period 

Even though our model has many interesting features, the strategy and 
neighborhood update rules represent an extreme and conservative choice 
of rules Thus, 1t would be important to investigate how the modification 
of the model rules affects the results reported here Some of the points 
that may merit further research 1n connection with our strategy update 
and network update rules are outlined 1n the remainder of this article 

Starting with our strategy update rule, we note that an important as- 
sumption made in our model is that the satisfaction of an agent 1s de- 
termined by comparison of absolute payoffs Taking into account the 
evolution of the social interaction, this assumption implies that two neigh- 
bors might have different payoffs because they have a different number 
of neighbors On the one hand, this assumption highlights the importance 
of being highly connected On the other hand, basing a comparison on 
the relative payoff per neighbor implies that each agent knows how many 
neighbors each of its neighbors has Additionally, our strategy update rule 
is an extreme “copy best” rule, while alternatives like probabilistic ımı- 
tation of better role models (Schlag and Pollock 1999) or probabilistic 
selection of neighbors! strategy (Nowak and May 1993) could also be 
considered 

For the network update rule, we note that, in our model, only links 
between two defectors can be broken In a stationary state, most coop- 
erators are unsatisfied because they imitate an agent with a larger payoff 
In our model, this unsatisfactory situation does not induce action to 1m- 
prove payoff beyond the imitation of a neighbor 1n the social network 
One way to test the ımplıcatıons of this hypothesis would be to modify 
the network update rule, letting the C—C link also be broken For example, 
we could let cooperators have some probabihty of changing a D neighbor 
with a random agent In this setting, cooperation would be further en- 
hanced, and the occurrence of large social crises could be limited. Thus, 
our rule for neighborhood update assures in particular that the highly 
successful defectors will not be abandoned by the cooperators whose ex- 
ploitation has made them so successful Hence, we make defection an 
attractive option in that exploited agents cannot simply leave their ex- 
ploiter because of, for example, changed costs of relationship or some 
(semi-)rational calculation of the expected benefit of creating new ties A 
related point 1s that one could question why D-D hnks cannot be broken 
except for those that involve an unsatisfied agent We have in fact assumed 
that only agents having at least one neighbor with higher payoff are 
“forced” to do something to improve their payoff Thus, we assume some 
kind of “aversion to change” one will do something only when one 1s not 
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satisfied, otherwise one will do nothing This rule can be justified by the 
cost implicit in the change of a social relation, and ıt allows, ın principle, 
the maintenance of interaction between two defectors as long as they are 
both satisfied 

There 1s another point that deserves further investigation and that 
relates both to the strategy and network update rules our choice of strat- 
egy update 1s based on the aggregate payoff, while our justification of the 
network adaptation rule 1s based on comparison of individual actions 
This a prior! assumption involves two different mechanisms, and ıt 1s 
clear that our choice 1s one among several possibilities We have already 
mentioned a strategy update (Nowak and May 1993) based on probabı- 
listic selection among the neighbors’ strategies weighted by the aggregate 
payoff Cohen et al (2001) compares different such mechanisms on fixed 
networks On the other hand, fewer results are known about network 
update mechanisms In the economics literature, a common rule 1s that 
a link between two economic agents 1s accepted if both agents 1mprove 
their payoff (Jackson and Watts 2002, Bala and Goyal 2000) It would 
be interesting to test new strategy and network adaptations involving a 
more sophisticated evaluation of the strategy of the opponent and its 
aggregate payoff One possible way would be to contemplate a more 
complex strategy space, involving strategies dependent on past encounters 
Work 1n this direction with a fixed social network was initiated by Lind- 
gren and Nordahl (1994) Another interesting variable to consider 1s no- 
nequivalent agents that may differ in their attractiveness as exchange 
partners Flache (2001) has incorporated this situation ın a model that 
combines the agents’ decisions about cooperation with the decisions about 
selection of new partners This also resulted in a process of social differ- 
entiation sustaining cooperation It 1s unclear how much of the process 
of social differentiation originates 1n the unequally attractive agents or 1n 
the simpler mechanisms contained 1n our model 

Beyond our update rules, the addition of random perturbations in strat- 
egy and network 1s also a very relevant feature to be explored ın the 
future These perturbations may originate from errors ın imitation or 
payoff determination, for example Based on our results, we can see that 
strategy perturbations may cause large social crises, especially if a well- , 
connected agent makes an error 

We finally remark that many situations 1n human societies follow the 
dynamic scheme considered here—new collaborations are frequently 
formed, while other long-lasting partnerships die out From scientific col- 
laboration to sports teams—not to mention political parties—our study 
offers an example of a simple mechanism by which leadership and other 
social roles might appear and consolidate 
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The Emperor’s Dilemma: A Computational 
Model of Self-Enforcing Norms’ 
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Cornell University 


The authors demonstrate the uses of agent-based computational 
models in an application to a social enigma they call the “emperor’s 
dilemma,” based on the Hans Christian Andersen fable In this 
model, agents must decide whether to comply with and enforce a 
norm that 1s supported by a few fanatics and opposed by the vast 
majority They find that cascades of self-reinforcing support for a 
highly unpopular norm cannot occur ın a fully connected social 
network However, 1f agents’ horizons are limited to ımmediate 
neighbors, highly unpopular norms can emerge locally and then 
spread One might expect these cascades to be more likely as the 
number of “true believers” increases, and bridge ties are created 
between otherwise distant actors Surprisingly, the authors observed 
quite the opposite effects 


Naturally, the best proof of the sincerity of your confession was 
your naming others whom you had seen in the Devil company 
—Arthur Miller, 1996 


THE POPULAR ENFORCEMENT OF UNPOPULAR NORMS 


In “The Emperor’s New Clothes” Hans Christian Andersen ([1837] 1998) 
tells the story of three rogues who sell a foolish monarch a nonexistent 
robe that they clarm cannot be seen by those who are “unfit for office” 
or “incorrigibly stupid ” Fear of exposure leads the emperor, and in turn, 
each of the citizens, to express admiration for the new clothes, which then 
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reinforces the illusion of widespread support for the norm The spell 1s 
broken when a child, innocent of the norm, laughs at the naked old man 
It 1s not hard to find everyday examples of this fable 1n the academic 
kingdom We can all think of prestigious scholars who are widely pro- 
claimed as having the most brilliant new ideas, yet privately, people find 
the work entirely incomprehensible Some may worry that perhaps they 
are indeed ınadequate—that those who cannot see these beautiful ideas 
must be “incorrigibly stupid ” Others are quite certain that the emperor 
ıs naked but worry about being dismissed as an intellectual lightweight 
by enthusiasts who clearly seem to understand and appreciate every word 
'The safest course 1s to go along with the charade and admire the em- 
peror—thereby reinforcing this same false belief among our colleagues 
The problem 1s not limited to faculty Studies of campus attitudes to- 
ward drinking find that students anticipate negative social consequences 
for failing to participate ın drinking rituals that celebrate intoxication as 
a symbol of group identity, especially in fraternities (Nagoshi et al 1994, 
Perkins and Wechsler 1996, Baer 1994, for a review, see Borsarı and Carey 
(2001) Yet Prentice and Miller (1993) found that students were privately 
less comfortable with alcohol use than they (falsely) perceived other stu- 
dents to be The study suggests that, contrary to campus legend, students 
are actually somewhat uncomfortable about excessive drinking, at least 
when they are sober 
According to Prentice and Miller (1993), students ın their college drink- 
ing study are victims of “pluralistic ıgnorance,” a term first coined by 
Allport (Katz and Allport 1931, p 152) Pluralistic 1gnorance describes 
situations where a majority of group members privately reject a norm, 
but assume (incorrectly) that most others accept it (see Miller and 
McFarland [1991] and O’Gorman [1986] for reviews) It ıs, m Krech and 
Crutchfield’s (1948, pp 388-89) words, the situation where “no one be- 
heves, but everyone thinks that everyone believes ” The illusion of support 
is validated when ıt motivates widespread public compliance 
Pluralistic 1gnorance has been documented not only among groups that 
indulge but also among those that abstain For example, in Schank’s 
(1932) classic investigation, the members of a religious community were 
observed publicly endorsing norms against gambling, smoking, and drink- 
ing that they violated ın private More recently, Kitts (2003) found that 
students in five vegetarian housing cooperatives overestimated public sup- 
port for dietary norms that were publicly enforced but privately violated 
Kitts tested relational explanations (“selective exposure” and “selective 
disclosure”) against social psychological theories of cognitive bias Con- 
sistent with theories of pluralistic ignorance, he found greater support for 
the relational effects of differential access to information about others’ 
compliance 
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Other examples are more disturbing O’Gorman found that American 
whites grossly exaggerated other whites’ support for segregation ın the 
late 1960s (1975, O’Gorman and Garry 1976) A similar pattern can be 
found ın other repressive regimes In his book Private Truths and Public 
Lies, Timur Kuran (19952) points to widespread but illusory support for 
the communist regime ın the former Soviet Union, based ın part on fear 
of denunciation for revealing private opposition to neighbors whose ap- 
parent enthusiasm for the regime was ın fact equally a charade, and for 
the same reason ? 

A similar dynamic ıs evident ın witch hunts As noted by Erikson (1966), 
witch hunts are caused not by an outbreak of deviance, but by an outbreak 
of enforcement Witches are created by anxious neighbors seeking to af- 
firm their status 1n the community by accusing others of deviance, thereby 
perpetuating the fear that fuels the need for affirmation Those accused 
can then save themselves only by revealing the names of yet other neigh- 
bors Perhaps no one ın this population actually believes ın the existence 
of witches Yet a terrified public turns out to cheer at the executions, ın 
public expiations of a collective anxiety that 1s of their own making This 
self-reinforcing dynamic indeed casts a spell on the community as pow- 
erful as that of any witch 

We need not assume this dynamic 1s some historical relic of superstition 
Witch hunts were highly publicized on both sides ın the early years of 
the Cold War Contemporary witches may also include gays assaulted by 
young thugs eager to affirm their manhood A study by Adams, Wright, 
and Lohr (1996) found that homophobic men rated themselves as having 
lower levels of arousal than other men when shown videos of homosexual 
intercourse However, physiological measures of sexual response were 
found at higher levels among the homophobic men The results suggest 
that aggressive same-sex enforcement of heterosexual norms may be mo- 
tivated by anxiety over the transparency of hidden deviation Research 
on adolescent gangs (Willis 1977, MacLeod 1995) shows how homophobic 
humor 1s used to ridicule group members who lack the requisite toughness 
and to affirm the status and loyalty of those who might otherwise become 
suspect themselves 

The willingness to feign support for a public he has also been dem- 
onstrated under laboratory conditions In a classic study, Asch (1951) 
showed that participants would conform to a consensus judgment they 


* Kuran (1991, 1995a, see also Bicchiert and Fuku 1999) also highlights the potential 
for rapid collapse, triggered when a few vanguards finally express their actual belief, 
which encourages others to overcome social pressure and express their actual belief, 
and so on 
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knew to be false rather than risk social isolation as a deviant When 
participants were assured anonymity, the false comphance disappeared 

It 1s not difficult to find other familar examples of compliance with, 
and enforcement of, privately unpopular norms 


1 the exposure of the “politically incorrect” by the righteously indignant 
who thereby affirm their own moral integrity, 

2 gossiping about a social faux pas by snobs anxious to affirm their 
own cultural sophistication, 

3 public adoration of a bully by fearful schoolboys who do not want 
to become the next victim, 

4 “luxury fever” (Frank 2000) among status seekers who purchase $50 
cigars, $17,000 wristwatches, and $3 million bras, m an arms race 
of conspicuous consumption and one-upmanship that leaves the con- 
testants no happier but perhaps a bit less affluent 


Naked emperors are easy to find but hard to explain It ıs easy to 
explain why people comply with unpopular norms—they fear social sanc- 
tions And it 1s easy to explain why people pressure others to behave the 
way they want them to behave But why pressure others to do the op- 
posite? Why would people publicly enforce a norm that they secretly wish 
would go away? 

One hypothesis 1s that very few would actually enforce the norm, but 
no one knows this If people estimate the willingness to enforce based on 
the willingness to comply, and they comply based on the false belief that 
others will enforce, they become trapped in pluralistic 1gnorance—an equi- 
librium ın which few people would actually enforce the norm but no one 
realizes this However, this equilibrium can be extremely fragile As in 
the Andersen story, all that 1s needed 1s a single child to laugh at the 
emperor and the spell will be broken 


THE ILLUSION OF SINCERITY 


A more robust explanation 1s that most people really will enforce the 
norm, and for the same reason that they comply—social pressure from 
others in the group, for whom mere compliance 1s not enough * To the 


3 Norms mandating the enforcement of other norms are sometimes referred to as “me- 
tanorms” (Axelrod 1986, Horne 2001) Explicit obligations to enforce can be found ın 
systems of collective sanctioning in which a group 1s made responsible for the com- 
pliance of its members A good example ıs an honor society that obligates individuals 
to punish family members or fellow students who dishonor the group by cheating on 
a spouse or an exam (Vandello and Cohen 2004) Classmates who fail to report a 
student who cheated on an exam are also guilty, as are those who fail to report those 
who tolerated cheating In “cultures of honor” (Nisbett and Cohen 1996), a daughter’s 
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true believer, it 1s not sufficient that others go to the right art galleries, 
display the right body jewelry, purchase the right sports car, or support 
the right wing They must do it for the right reason Zealots believe that 
ıt ıs better not to comply at all than to do so simply to affirm social status 
(Kuran 1995a, p 62) Such compliance lasts only so long as behavior can 
be monitored and social pressure is sufficient to induce acquiescence 
(Hechter 1987) Thus, true believers reserve special contempt for 1mpos- 
ters Those who comply for the wrong reason must worry about being 
exposed as counterfeit 

The hypothesized anxiety 15 supported by research on the “illusion of 
transparency" (Gilovich, Savitsky, and Medvec 1998) This refers to a 
tendency to overestimate the ability of others to monitor our internal 
states Savitsky, Epley, and Gilovich (2001) found that individuals tend 
to overestimate how harshly others will judge them for a public mishap 
Across four experimental studies, actors anticipated being more harshly 
evaluated than was actually the case 

Applied to the emperor’s dilemma, the “illusion of transparency” sug- 
gests that those who admire the emperor out of a desire for social approval 
fear that their posturing will be apparent to others They then look for 
some way to confirm their sincerity Enforcing the norm provides a low- 
cost way to fake sincerity, to signal that one complies—not as an oppor- 
tunist seeking approval—but as a true believer 

What better way to signal one’s sincerity than to act ın a way that 
encourages others to comply (Kuran 19955, p 61)? When one’s moral, 
political, or professional “fitness for office” 1s challenged, people rarely 
turn the tables on their inquisitors If conformity 1s sanctioned, while 
enforcement 1s not, conformists may be suspected of posturing ın order 
to gain social approval, but those who enforce conformity appear to be 
the genuine article This use of enforcement to signal sincerity explains 
the apparent fanaticism of “new recruits” who must prove their loyalty 
to the established members of a cult or gang, and 1t also raises the pos- 
sibility that the thought police may actually be imposters themselves, a 


sexual transgression dishonors the entire family until someone (e g , the father or oldest 
brother) restores the family’s social position by carrying out the obligatory “honor 
killing ” Honor codes also obligate punishment of out-group members who violate 
strict rules regarding displays of respect (Bourdieu 1966, Elster 1990) Failure to carry 
out the vendetta leads to being labeled a coward and can result in further attacks on 
one’s family (Gould 2000) Those who befriend a “coward,” in turn, come to be tar- 
nished with the same brush Simply put, every violation of the code of honor becomes 
an acid test of everyone’s loyalty to and standing within the community 
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defensive tactic Freud (1894) called “reaction formation” (Baumeister, 
Dale, and Sommer 1998) * 

In the college drinking example, insecure freshmen who worry about 
social acceptance may be tempted to drink—and to celebrate intoxica- 
tıon—ın order to appear “cool” However, they must not appear to be 
motivated by this goal or they risk being scorned as a “poser” Thus, 1t 
is not enough to *party"—they must also express the belief that drinking 
is cool and act accordingly, thereby adding to the social pressure that 
leads others like them to join 1n 

Or consider those who pretend to appreciate some highly opaque 
scholar ın order to affirm their erudition ? Privately, they have no clue 
what the writings mean (if anything), and they worry that true believers 
will see them as fakes (the illusion of transparency) The solution 1s easy 
simply disparage those intellectually shallow scholars who fail to appre- 
ciate real genius But when one does this, one adds yet another voice to 
the chorus of intimidation that induces the insecurity motivating the be- 
havior ın the first place The norm becomes self-enforcing 

Can self-enforcing norms emerge ın a reluctant population, ın the ab- 
sence of any top-down institutional repression, or does it require a pow- 
erful emperor to Jump-start the process? Can unpopular enforcement be 
entirely self-organizmg? How many true believers are required to trigger 
a cascade that pulls in the disbelievers as well? Do these cascades depend 
on the structure of social networks? How stable 1s an unpopular equilib- 
rium, and how large a disturbance 1s required for 1t to collapse? 

One way to find the answers to these questions 1s to systematically 
study norm enforcement in communities suspected of being trapped in 
an emperor's dilemma Although it 1s not difficult to find empirical case 
studies, 1t may be nearly impossible to find convincing evidence that the 
supporters of the norm are actually imposters It 1s not clear that people 
even really know their own beliefs (Nisbett and Wilson 1977), given the 
power of self-deception and the subconscious state of reaction formation, 
but even if we assume that people have perfect interior knowledge of 
mental states, they have no direct access to the mental states of others 
If people are unable to distinguish true believers and 1mposters, this may 
also be true for social scientists, including participant observers 

These difficulties have led researchers to study unpopular norm com- 


* Based on their review of empirical work on reaction formation, Baumeister et al 
(1998) concluded that “people respond to the implication that they have some unac- 
ceptable trait by behaving ın a way that would show them to have the opposite trait" 
(1998, p 1,085) 

$ This anxiety 15 supported by research on “the imposter phenomenon" and refers to 
the feeling, common among academics and professionals, that one 1s an “intellectual 
fraud" (Clance 1985) 
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phance through formal methods, such as game theory It can be shown, 
for example, that 1f an entire population 1s enforcing a norm that compels 
intolerance of deviance by oneself and others, then enforcement of the 
norm 1s a Nash equilibrium Even if everyone prefers that the norm would 
disappear, no one has an incentive to change strategy unilaterally— 
thereby becoming the lone deviant in a population of enforcers (Hecka- 
thorn 1990, Binmore 1998) 

However, knowing that an equilibrium exists does not mean that this 
outcome 1s likely or even attainable A growing interest 1n out-of-equi- 
librium dynamics has led game theorists to use evolutionary models to 
study the emergence and stability of equilibria from a variety of initial 
conditions (Skyrms 1996, Young 1998, Gintis 2000) However, these an- 
alytic models are often mathematically tractable only 1f populations are 
assumed to be fully connected or randomly connected Yet research on 
cascades points to the decisive importance of the structure of local inter- 
actions (Watts 2002, Centola, Macy, and Eguılız 2004) 

Given these difficulties with traditional methods, computational mod- 
eling can be a useful approach for studying normative cascades An em- 
piricist might object that computer simulation 1s just as much a fairy tale 
as Andersen’s story We do not dispute that, but we would pomt out that 
fairy tales populated by computational agents are capable of attaining a 
much higher level of logical consistency than those populated by char- 
acters expressed in natural language 

On the other side, analytical game theorists criticize computational 
models as numerical rather than mathematical, but as Abbott (1998, pp 
176-77) has noted, "[analytical] game theory will not get us very far 
because it 1s ignorant, except in the most general terms, of a serious 
concern with structure and with complex temporal effects But simulation 
may help us understand the limits and possibilities of certain kinds of 
interactional fields, and that would be profoundly sociological 
knowledge ” 

Multiagent models can be useful for studying processes such as informal 
social control that lack centralized coordination These models focus on 
how simple and predictable local interactions generate familiar but often 
enigmatic global patterns, such as cascading enforcement of unpopular 
norms By looking for ways to generate these cascades under controlled 
conditions in a population of computer agents, we may find some clues 
about the dynamics to look for when we try to model the natural world 
empirically 
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UNPOPULAR NORMS AN AGENT-BASED COMPUTATIONAL MODEL 


We model a heterogeneous population of agents who differ in their beliefs 
and convictions Each agent : has a binary private belief, B,, which defines 
the agent as either a “true believer” (B, = 1) or a disbeliever (B, = —1) 
A small group of true believers (analogous to Andersen’s three rogues) 
have such strong convictions that they always comply with the norm, 
regardless of social pressure not to comply When dissatisfied with the 
level of comphance by others, they may also enforce the norm We call 
this “true enforcement” because the agent 1s enforcing compliance with 
its true (privately held) belief 

The remainder of the population (analogous to Andersen’s citizens) 
consists of skeptics (“disbelievers”) who privately oppose the norm, but 
with less conviction compared to that of the true believers This opposition 
can lead them to deviate from the norm and even to pressure others to 
deviate as well This 1s also “true enforcement” because the disbeliever 1s 
enforcing its true belief, which happens to be 1n opposition to the norm 
However, because their convictions are not as strong as those of true 
believers, disbelievers can also be pressured to support the norm publicly 
This support includes not only compliance with the norm but can also 
include pressuring others to comply as well We call this "false enforce- 
ment" because the agent 1s enforcing a behavior that does not conform 
with the agent's private beliefs Thus, there can be three enforcement 
possibilities 


1 true enforcement by true belzevers (who truly support the norm), 

2 true enforcement by true dzsbelzevers (who truly oppose the norm), 

3 false enforcement by false disbelzevers (who privately oppose but 
publicly support the norm) * 


At each iteration, each agent observes how many of its neighbors com- 
ply with the norm and how many deviate Agents also observe how many 
neighbors are pressuring others to comply and how many are pressuring 
others to deviate These numbers are used to ascertain the level of public 
support for and opposition to the norm These distributions, in turn, 
influence the two decisions that the agent must make—whether to comply 
with or deviate from the norm, and whether to pressure others to comply 
or to deviate 


“In one experiment, we also allow for a fourth possibility false believers who privately 
support but publicly oppose the norm 
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Compliance 


The compliance decision 1s based on the level and direction of social 
pressure relative to the strength S of an agent’s convictions, where 0 € 
S <1 More precisely, we model agent 2s decision to comply with the 
norm (C) as a binary choice, where C, = 1 if 2 chooses to comply and 
C, = —1 otherwise Social pressure 1s defined as the sum of enforcement 
decisions by 2's neighbors Each neighbor 7 who enforces the norm 
(E, = 1) increases the pressure on 2 to comply, and each neighbor who 
enforces deviance (E, = —1) increases the pressure to deviate A positive 
net value means that there 1s greater pressure 1n the direction of com- 
pliance, while a negative net value promotes deviance A disbeliever com- 
plies with the norm if social pressure overcomes the agent’s opposition, 
given the strength of the agent’s conviction 


ZH 

-B, if —PE,»8, 
N, j=l 

C, = (1) 


B, otherwise 


Equation (1) states that an agent can be expected to violate 1ts belief (such 
that a disbeliever complies and a believer deviates) if and only if the 
proportion of neighbors enforcing falsification mmus the proportion en- 
forcing the opposite 1s sufficient to overcome the strength of the agent’s 
conviction’ Thus, in order for opponents of the norm (B, = —1) to be 
pressured into compliance (C, = 1), positive social pressure must exist, 
and for supporters of the norm (B, = 1) to be pressured into deviance 
(C, — —1), negative social pressure 1s necessary 

By default, we assume that true believers (B, = 1) have maximal con- 
viction (S, = 1) and therefore always comply, even if all their neighbors 
enforce deviance For disbelievers, however, convictions are not so strong 
and may be overcome 1f the social pressure 1s sufficiently positive 

The public comphance decision intersects with private beliefs to create 
four agent types true behevers and true disbelievers (whose public be- 
havior conforms to their private beliefs), and false believers and false 
disbelievers (whose public behavior differs from their private beliefs, ın 


’ Although the model 1s dynamic and decisions are iterated, the model 1s not indexed 
on time because we assume asynchronous updating, which means that agents decide 
to comply and then enforce based on the pressures that exist at the moment of the 
decision Their decisions then alter the conditions on which other agents (choosing 
later) base their decisions To avoid order effects, we randomize the sequence in which 
agents make decisions To test robustness, we also replicated our experiments using 
synchronous updating (in which all agents update their decisions at the same time, 
based on conditions that existed at the end of the previous iteration) and found no 
qualitative differences in the results 
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response to the effects of social pressure) By default, we assume that true 
believers cannot become false disbelievers, because their convictions are 
too strong to be overcome by any amount of pressure to deviate However, 
we also explore a special case in which private beliefs of false believers 
(B, = —1) can eventually conform to public behavior (C, = 1), creating 
“converts” with B, = 1 These newly converted believers have weaker 
convictions than those of the original true believers and could eventually 
be pressured to “flip-flop” on the issue Equation (1) thus also allows for 
the possibility of a “false disbehever" and shows the level of negative 
pressure required to flip these converts (now with B, = 1) back to their 
original behavior (C, = —1) 


Enforcement 


An agent’s enforcement decision 1s informed by concerns that differ de- 
pending on whether the agent’s enforcement 1s true or false Since false 
believers are secretly opposed to the norm, they have no interest 1n pres- 
suring others to comply with a norm that goes against their private con- 
victions Unlike true believers, who sanction to promote compliance, and 
true disbelievers, who sanction to promote deviance, the false behever 
sanctions to avoid exposure as an opportunistic ımposter The tendency 
to falsely enforce thus increases with increasing social pressure to support 
the norm and decreases with increased conviction, exactly as with the 
decision to comply However, we assume that enforcement imposes an 
additional cost, beyond the costs associated with compliance ê Thus, the 
threshold for false enforcement 1s higher than the threshold for compliance 
by an amount corresponding to the additional cost K incurred by those 
who choose to enforce as well as comply, where 0< K <1 


—p də 
—B, if Gr QE, >S, +K A (B, + C) 


2 J71 
E, — +B, if SW>K)A(B, =C, (2) 
0 otherwise 


The top line of equation (2) 1s identical to equation (1), except that greater 


5 To simplify the model, we omit cost from the compliance algorithm and treat the 
cost of enforcement as the added cost of also enforcing a norm with which one ısın 
compliance This added cost might be relatively small for sanctions based on social 
approval or quite large for those based on material rewards or any form of disapproval 
that carnes a risk of inciting retaliation 
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social pressure 1s required to induce false enforcement than to induce false 
compliance, by the amount K, corresponding to the added cost of en- 
forcement A false believer (B, + C,) can be expected to enforce a norm 
that the agent privately opposes 1f and only 1f social pressure 1s sufficiently 
positive (1 e , supporters of the norm outweigh the opponents) to overcome 
the agent's reluctance to enforce, given both the strength of the agent's 
private opposition to the norm and the cost of enforcement 

Only “posers” feel the need to affirm the sincerity of their false behavior 
by pressuring others to act likewise For everyone else, the decision to 
enforce conformity to their belief 1s more straightforward This decision 
1s based on one's interest ın having others conform with the way one 
wants them to act, given one's private belief (B,) and the strength of one's 
conviction (S,) Thus, the effect of conviction on true enforcement 1s the 
opposite of the effect on false enforcement Conviction zmhzbzts false en- 
forcement (thus S appears on the right-hand side of line 1 in eq [2]), while 
conviction promotes true enforcement (thus S appears on the left-hand 
side of line 2) We assume further that the effect of conviction depends 
on the need for enforcement Agents only pay the cost of enforcement 
when 1t 15 needed to promote the desired behavior, and they stop enforcing 
when there 1s sufficient conformity with their beliefs that enforcement 1s 
no longer warranted For example, ıf everyone 1s complying, true believers 
have no need to enforce, even though their convictions are maximally 
strong The higher the level of deviance among their neighbors, the greater 
the need of true believers to invest effort in a campaign of social control 
Similarly, the higher the level of compliance among a disbeliever’s neigh- 
bors, the greater the need for a disbeliever to invest effort to pressure 
others into noncompliance with the norm 

The need for enforcement (W) 1s simply the proportion of 2's neighbors 
whose behavior does not conform with ?'s beliefs (B,), or 


N, 
1 — (B/N) È C, 
man EL 


W = ^ (3) 
Equation (3) rescales the aggregation of C over 2’s N neighbors 7 (which 
ranges from —1 when all; deviate to +1 when all 7 comply), so that the 
result corresponds to the proportion of neighbors whose compliance be- 
havior does not conform with 2’s belief B, (which ranges from O when all 
j conform to z”s belief to 1 when all 7 violate 2's belief) Substituting for 
W, ın the middle line of equation (2), the algorithm implies that agents 
enforce conformity with their private beliefs when conformity falls below 
the level they are willing to tolerate, given their belief 1n the norm and 
the strength of their conviction 
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We assume that agents can only enforce compliance if they have also 
complied, and they can only enforce deviance if they have also deviated ? 
However, given the cost of enforcement (K), it remains possible to comply 
with a norm but fail to enforce 1t and to deviate without enforcing de- 
viance Thus, unlike the complementary rates of compliance and deviance, 
the rates of enforcement for and against the norm need not sum to unity 
Everyone must choose whether to comply with the norm, but it 15 possible 
that no one enforces anything (E = 0) 

Equations (1)-(3) require start-up assumptions about the initial levels 
of compliance and enforcement A conservative assumption ın a test of 
false enforcement 1s that agents initially conform to their private convic- 
tions and no one enforces anything Hence, there 1s initially no pressure 
to comply, nor any pressure to enforce falsely 

We initialized the population as either fully connected or clustered into 

small, ego-defined “neighborhoods” such that each agent has eight neigh- 
bors We fixed the distribution of beliefs so that the norm could range ın 
popularity from zero to nearly universal True believers’ convictions were 
fixed at 1 0, meaning that they were immune to social pressure For dis- 
believers, convictions were uniformly distributed ın the range 0« S, < 
38 (shghtly above three out of eight nexghbors), with a mean of 0 19 
Although the mean conviction of true believers (S, = 1) 1s over five times 
that of disbelievers, they are also outnumbered 100 to 1, making the norm 
highly unpopular overall For a disbeliever with minimal conviction to 
be pressured into compliance, there must be at least one more neighbor 
enforcing compliance than enforcing deviance A disbeliever with maxi- 
mal conviction (S = 38) and eight neighbors requires at least four more 
neighbors enforcing compliance than enforcing deviance in order to be 
pressured into compliance The threshold for false enforcement 1s higher 
than the threshold for compliance by the value of K, which we fix at 125 
For agents with eight neighbors, this means that at least one additional 
neighbor must enforce before a false believer becomes a false enforcer as 
well 

These values of conviction and cost form a cumulative uniform dis- 
tribution of false enforcement thresholds across the population of disbe- 
levers, such that, for agents with eight neighbors, if two more neighbors 
are enforcing compliance than are enforcing deviance in every disbe- 
hever's neighborhood, then about one-third of the disbelievers will falsely 
enforce If three more neighbors enforce compliance than deviance, then 
about two-thirds of disbelievers will falsely enforce And if four more 


? While agents may hypocritically enforce a norm they privately question, they never 
enforce norms which they violate We leave for future research the effect of hypocritical 
enforcement based on the rule “Do as I say and not as I do" (Heckathorn 1989) 
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neighbors enforce compliance than deviance, then just under 99% of dis- 
believers will falsely enforce 

We tested several variations of equations (1)-(3) to assess the robustness 
of the model, which uses a deterministic step function for the decisions 
to comply and enforce The step function implies that agents are ındıf- 
ferent to changes in social influence at all levels of influence except for a 
single critical value We relaxed this assumption by replacing the step 
function with a sigmoidal stochastic approximation (implemented as a 
cumulative logistic function), in which the probability of the decision 
increases with influence, such that the probability 1s 0 5 when the thresh- 
old is crossed and approaches the natural limits asymptotically The results 
affirmed all conclusions based on the deterministic model Moreover, with 
stochastic decisions, cascades of false enforcement can be randomly trig- 
gered ın a population with no true believers, which is impossible in the 
deterministic specification. However, the latter has the didactic advantage 
of greater simplicity, and we therefore use the deterministic model for the 
presentation of our results 


EXPERIMENTAL DESIGN 


We want to know 1f a small minority of fanatics can force a few vulnerable 
disbelievers (S, s“ 0) to help enforce the norm, which will then increase 
the pressure on more stalwart disbelievers, triggering a cascade of false 
enforcement that sweeps through a skeptical population We also want 
to see 1f this cascade process ultimately produces a stable enforcement 
equilibrium that would persist even if every true believer were to exit the 
population To find out, we use a series of computational experiments to 
study the emergent dynamics as we manipulate a set of structural con- 
ditions 


1 access to information about the behavior of other agents (from global 
to local, where local access lımıts enforcement pressure to neighbors 
In à Social network), 

2 the frequency distribution and clustering of true believers (from less 
than 1% to more than 99% of the population and from highly clus- 
tered to randomly dispersed), 

3 the network topology (from highly ordered networks to irregular 
grids to small world networks with a much shorter characteristic 
path length) 


We explore network topologies using a two-dimensional cellular au- 
tomata (von Neumann 1966) consisting of 1,000 agents located on a 
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40 x 25 torus, with every cell occupied by one agent “ In a regular lattice, 
each cell has a “neighbor” (or social tie) to the 1mmediate right, left, top, 
and bottom This elementary structure, called the von Neumann neigh- 
borhood, 1s characterized by the absence of transitivity (or triad closure, 
in which one’s neighbors are also neighbors of one another) If we also 
include the four adjacent diagonal neighbors (creating a Moore neigh- 
borhood), we create ties among neighbors, a property of social networks 
that 1s supported by research ın structural balance theory (Cartwnght 
and Harary 1956) 

Although transitive social ties are more plausible than von Neumann 
neighborhoods, the regularity of Moore neighborhoods remains highly 
stylized We relaxed the regularity of the lattice in two ways First, we 
created Voronoi diagrams or “irregular grids" (Flache and Hegselman 
2001) that preserve the transitivity and overlap of Moore neighborhoods 
but allow agents to vary 1n degree (or number of neighbors) The results 
using irregular grids were identical to those using Moore neighborhoods, 
so only the results with the simpler regular structures are reported 

Second, following the procedure used by Watts (1999), we randomly 
rewired ties with probability P (0 € P € 1) With P = 0, the network 15 
a regular lattice, and with P = 1 the network 1s completely random An 
intermediate regime, called a small world network, 1s characterized by a 
high degree of clustering with a low characteristic path length and has 
been shown to be representative of a large number of social and biological 
populations (Watts and Strogatz 1998) Using the standard nine-cell Moore 
neighborhood as our baseline, we explore alternative network topologies 
and observe their effects on cascades of false enforcement 


RESULTS 


To preview the mam results, we report three notable findings on the effects 
of network topology and the distribution and clustering of true believers 


1 Embeddedness —Although universal enforcement of a highly un- 
popular norm 1s an equilibrium state in a fully connected population, 
a dynamic model shows that this equilibrium cannot be reached from 
out of equilibrium for a highly unpopular norm, supported only by 
a very small number of true believers Yet the equilibrium is easily 
reached if an identical population 1s embedded in a network that 
restricts 1nteraction to small but overlapping neighborhoods 


10 We also tested lattices with up to 5,000 cells Increasing the size of the network had 
no effects on the results 
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2 Less 1s more —Under conditions ın which even very large numbers 
of true believers are unable to quash deviance by disbelievers, a 
much smaller number of true believers can successfully ignite cas- 
cades that lead to near-universal compliance and enforcement of the 
norm Simply put, we identified conditions ın which five true be- 
levers can accomplish what 500 cannot By extension, when we 
allow false enforcers to convert into true believers, it does not sta- 
bihze the high-compliance equilibrium, but causes it to collapse 

3 Small worlds —Bridge ties between otherwise distant neighbor- 
hoods—such as the “weak ties” (Granovetter 1973) that have been 
shown to promote the diffusion of information, innovations, rumors, 
and disease—are shown to inhibit cascades of false enforcement 


Experiment 1 Effects of Embeddedness 


We begin by testing the effects of network embeddedness on cascades of 
enforcement of a highly unpopular norm (less than 1% of the population 
are true believers) Experiment 1 manipulates the distribution (random 
vs clustered) and embeddedness (global vs local) of true believers and 
measures the effects on the proportion of disbelievers who falsely comply 
Since clustering 1s meaningless in a fully connected population, these two 
manipulations yield three experimental conditions global, local + clus- 
tered, and local + random 

Before turning to the results of experiment 1, it 1s useful to bear in 
mind that in a fully connected population, even with no true believers, 
universal enforcement of the norm is an equilibrium That 1S, 1f we in1- 
tialize the model such that everyone 1s falsely enforcing compliance with 
the norm, then no agent will unilaterally change strategy and become the 
lone deviant in a population of enforcers 

Nevertheless, the thin solid line ın figure 1 shows that this equilibrium 
cannot be reached 1n an unembedded population that 1s overwhelmingly 
skeptical (less than 1% true believers) if the starting point 1s far from 
equilibrium Figure 1 reports the proportion of disbelievers who comply 
with the norm as we increase the proportion of true believers from less 
than 1% to more than 99% For now, we focus only on the far left side 
of figure 1 (expanded in the nght-hand corner insert) as we compare 
compliance rates among disbelievers ın an overwhelmingly skeptical pop- 
ulation that differs only in embeddedness and clustering 

In a fully connected (unembedded) population (shown by the thin line 
in fig 1), each agent surveys every other agent to determine the levels of 
compliance and enforcement of the norm If the population 1s overwhelm- 
ingly deviant at the outset, true believers are readily willing to pay the 
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Fic 1—Effects of distribution and embeddedness of true believers (based on 1,000 agents 
and averaged over 100 replications at each parameter combination) The dark line indicates 
an embedded population 1n which the true believers are randomly distributed The dashed 
line shows an embedded population with clustered true behevers, and the thin hne shows 
an unembedded population The dashed line shows how a very small group of true believers 
must be clustered to produce sufficient pressure to ignite a cascade As we increase their 
numbers, dispersed true believers can randomly cluster to form a critical mass (dark line) 
True believers have little effect on disbelievers 1n an unembedded population unless they 
comprise about one-third of the population (thin line) Above that level, their effectiveness 
diminishes due to high rates of compliance they observe among one another, which dımın- 
ishes their zeal to enforce 


cost of enforcement (K = 125), but their numbers are too small to compel 
even the most spineless disbelievers to join their crusade Had this been 
the Andersen fable, everyone but the three rogues would be laughing at 
the foolish emperor 

So how did 1t happen that the citizens did not laugh, but instead ex- 
pressed fawning admiration? We hypothesize that the explanation centers 
on the assumption that social influence 1s not global but local 

To test this idea, we restricted agents’ observations to their immediate 
neighbors and randomly distributed five true believers across a population 
of 995 skeptics The result (indicated by the dark hne in fig 1) 1s similar 
to what we observe when information 1s global—everyone still laughs at 
the emperor Once again, the true believers fail to ignite a cascade 

But now suppose that agents are clustered by beliefs, a network prop- 
erty called homophily, based on the principle that likes attract (Simmel 
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1955, Carley 1991, Mark 1998, McPherson, Smith-Lovin, and Cook 200 1) 
When agents are clustered by behef rather than randomly distributed, a 
skeptic exposed to a true behever can now expect to have other true 
believers ın its neighborhood The results (shown by the dashed line in 
fig 1) now change dramatically, and the outcome predicted by Andersen's 
fable becomes almost unavoidable When true believers are clustered, 
even a very small number can spark a cascade of compliance 

To test the robustness of this result, we repeated the experiment, but 
this time we matched the true believers with an equal number of dis- 
behevers who had equally strong convictions (S = 1) For the remaining 
disbehevers, S was umformly distributed so that the average conviction 
of all disbehevers remained 19 Creating an equal number of “diehards” 
on both sides (while keeping true believers badly outnumbered) had no 
effect on the results The fanatic disbelievers could not be converted and 
always enforced their true beliefs, yet they were powerless to stop cascades 
of false enforcement 

We also repeated the experiment using a stochastic version of the model, 
in which the probabilities of compliance and enforcement are cumulative 
logistic functions of the level of influence specified 1n equations (1) and 
(2), respectively Remarkably, the results showed that xo true believers 
are needed, cascades of false enforcement can be triggered “spontaneously” 
by the random waverings of uncommitted disbehevers 

To better understand the cascade dynamics, figure 2 gives a stylized 
illustration using a neighborhood with a cluster of three true believers 
(white cells 5, 8, and 11) surrounded by twelve disbehevers (ight gray) 
at time 1 Each true believer 1s exposed to at least six deviants, which 1s 
sufficient to overcome the cost of enforcement (K = 125), and 5, 8, and 
11 begin to pressure their neighbors at time 2 (turning dark gray) Mean- 
while, two disbelievers (7 and 9) are likewise exposed to the three true 
behevers, and this provokes them to fight back at time 2 by enforcing 
opposition to the norm (turning black) 

At time 2, the true believers now observe not only deviance but also 
social pressure to deviate However, given their strong convictions 
(S = 1), this pressure has no effect on their resolve They continue to 
comply with the norm and to demand comphance of others Conversely, 
agents 7 and 9 now observe not only comphance but also pressure to 
comply from three of eight neighbors ( 375), causing them to falsify their 
behefs at time 3 (C, + B,) With K = 125, we know from equation (2) 
that this pressure 1s sufficient to trigger false enforcement by any agent 
whose level of conviction 1s less than 25, which includes about two-thirds 
of the disbelievers Suppose agents 7 and 9 succumb to the pressure Now 
agents 4, 6, 10, and 12 will have three neighbors that are enforcing com- 
pliance with the norm and zero neighbors enforcing deviance Hence, 
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Fic 2 — Cascade dynamics in a Moore neighborhood The white cells are complying with 
gray cells are also enforcing compliance, and the black cells are also enforcing deviance Enfi 
5, 8, and 11) compels neighboring disbelievers to falsely comply and falsely enforce False er 
soon not Just their neighbors, but also their neighbors’ neighbors are all enforcing the no 


disbelievers sustain the unpopular norm 
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each of these agents will now be triggered to enforce falsely By extension 
of this process, at time 4, the entire Moore neighborhood (agents 4—12) 
begins enforcing the norm (all dark gray) 

At time 5, agents 1, 3, 13, and 15 only have two out of eight neighbors 
enforcing (25) This pressure 1s likely to persuade them to comply with 
the norm but not to enforce ıt (since the enforcement threshold ıs higher 
by K) However, agents 2 and 14 now have three of eight neighbors 
enforcing the norm, and no pressure to deviate, so they will most likely 
join the cascade Thus, at time 6, with the additional pressure from agents 
2 and 14, agents 1, 3, 13, and 15 now have three neighbors enforcing 
compliance, and they too join in The level of compliance ıs now sufficient 
that none of the true believers need to enforce any longer Like the three 
swindlers in the Andersen fable, they can quietly leave the village, know- 
ing the citizens will now do their work for them 


Experiment 2 Effects of the Number and Clustering of True Believers 


Experiment 1 showed how even very small numbers of true believers can 
trigger a cascade of false enforcement 1n an embedded population, but 
only if they are sufficiently clustered in a neighborhood to generate suf- 
ficient peer pressure to trigger a chain reaction Experiment 2 further 
explores what happens as we increase the proportion of true believers 

The thin line ın figure 1 shows that this effect 1s U-shaped ın a global 
population As the number of true believers approaches one-third of the 
population, they can trigger a cascade ın a fully connected population ın 
the same way that three true believers can trigger a cascade ın a local 
neighborhood of size 9 (Of course, one hesitates to refer to this as an 
"unpopular" norm, considering that one-third of the population now has 
an unwavering conviction ın favor of norm compliance ) 

However, as we increase the number of the true believers even further, 
their enforcement effectiveness begins to diminish This happens because 
true believers have less need to enforce as the level of compliance increases, 
which means that the dynamics are self-limiting In contrast, false be- 
levers feel more pressure to enforce as the cascade progresses—it 1s a 
self-reinforcing dynamic Thus, the larger the proportion of true believers 
(who always comply), the lower the pressure on disbelievers For example, 
when there 1s an overwhelming majority of true believers (e g, 90% of 
the population), they observe that almost everyone 1s willing to comply 
without their having to pay the costs of enforcement Thus, true believers 
do not bother to enforce their views, and the tiny minority of disbelievers 
can enjoy an atmosphere of tolerance (but only so long as their numbers 
do not attract too much attention from the majority) 

We see a similar U-shaped effect in a population of embedded agents 
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with randomly distributed true believers (the dark line ın fig 1) However, 
the governing dynamics are somewhat different As the number of ran- 
domly distributed true believers increases, there 1s a greater likelihood 
that a critical mass of true believers will end up in the same neighborhood 
Moreover, the larger the number of such clusters, the greater the odds 
that a cluster will form m a neighborhood with disbelievers whose con- 
victions are relatively low, such that a cascade can find a foothold Thus, 
cascades of false enforcement are more robust as the number of true 
believers ıncreases—but only up to a point Asın the unembedded case, 
there can also be too many true believers for their own good True believers 
who are surrounded by compliance do not see the need to enforce, and 
this situation allows fringe pockets of deviance to persist 1n a population 
that 1s almost entirely compliant by conviction Had there been fewer 
true believers, they would have pressured nearby disbelievers to join the 
cascade, who would have pressured other disbelievers, and so on, until 
these pockets of dissent were eliminated by the self-reinforcing dynamics 
of false enforcement 

Finally, we tested for the U-shaped effect 1n a population of agents 
clustered by belief When true believers are tightly clustered, 1t takes only 
a very small number to trigger a cascade leading to universal enforcement, 
as already noted As their numbers increase, we might expect them to 
become less effective, as observed for the global population and for a 
local but randomly distributed population, due to the self-hmiting dy- 
namics of true enforcement However, when the true believers are densely 
clustered, we observe no reduction m effectiveness as their numbers 
increase 

This remarkable scalability 1s caused by the existence of an ordered 
boundary between the two populations The ratio of the two groups along 
this boundary 1s constant, regardless of the relative sizes of the two pop- 
ulations Even when the norm 1s very popular, the true believers along 
the border are exposed to high levels of deviance along the frontier, just 
as they would be if they were a minority ın an isolated neighborhood 
This contact motivates the true believers to enforce, and the cascade then 
fans out across the cluster of disbelievers, like a prairie fire This boundary 
effect suggests a structural interpretation of Erikson’s (1966, p 19) em- 
phasis on the importance in social control of “those we call deviants [who] 
patrol its boundaries ” 

As an extension of the experrment on the number of true believers, we 
also allowed for the possibility that agents might eventually change their 
private beliefs to conform to their public behavior " We therefore added 


! "This change of belief to conform to behavior 1s hypothesized to occur under conditions 
where social pressure prevents agents from changing their behavior to conform to their 
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an algorithm that modifies agent ?'s private belief B, based on the weighted 
cumulative experience of false enforcement, relative to the strength of 
conviction 


R'—AE'B' uw EÉ zB 
ttl o.. 1 2:74 1 2 
bu E otherwise, 4) 
piti = C; if (R77 > S) ^ (B; + Cj) (5) 
t Bi otherwise 


Equation (4) states that cognitive pressure to reconcile a dissonant belief 
increases by A with each decision to falsely enforce (E, = —B,), ıs unaf- 
fected 1f the decision 1s not to enforce (E, = 0), and ıs reset to zero should 
the agent truly enforce (E, — B,) Equation (5) states that private behef 
B, ıs then reconciled with public behavior when the dissonance is sufficient 
to overcome the “stubbornness” of 2's belief, given the strength of z”s 
conviction ($) The learning parameter A (0 € A < 1) controls how quickly 
private beliefs change to conform to public behavior We set A at 0001 
so that the cascade process would have sufficient time to complete before 
private beliefs begin to accommodate public behavior (with A = 0001, 
nearly 3,800 updatings will be required before disbelievers with S, = 
038 will change beliefs) 

Note that this experiment differs from the previous manipulation ın 
which we increased the number of true believers przor to the cascade 
Once the cascade has succeeded, one might expect increasing support for 
the norm to simply stabilize the equilibrium of norm compliance Sur- 
prisingly, the effect 1s exactly the opposite Figure 3 shows what happens 
as false enforcers begin to believe 1n what they are preaching—the un- 
popular norm destabihzes and eventually collapses 

The time series ın figure 3 illustrates the emergence of an unpopular 
norm in an embedded population with clustered true believers, followed 
by the decay of false enforcement Agents with weaker convictions are 
among the first to be converted to false enforcers As equation (5) indicates, 
they also have a high susceptibility to eventually being converted into 
true believers At some point after the population reaches an equilibrium 
of norm enforcement, weak-willed disbelievers, who have been falsely 


beliefs Significant research on cognitive dissonance and self-perception theories points 
to this assimilative effect of behavior on beliefs (Festinger 1957, Bem 1972) Prentice 
and Miller's research on unpopular drinking norms showed a tendency for male stu- 
dents’ behaviors and attitudes to gravitate over time toward what they falsely thought 
to be the norm (1993) As Wiliam James said, “We need only ACT as af the thing in 
question were real, and keep acting as if it were real, and it will ınfallıbiy end by 
growing into such a connection with our life that ıt will become real” (James [1890] 
1981, emphasis in original) 
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FiG 3 —Effect of conversion of disbelievers (based on a representative time series for 
1,000 agents) The solid line indicates norm compliance and the dashed line indicates norm 
enforcement The norm quickly spreads, and by time 80 the entire population 1s enforcing 
the norm except the true believers, whose efforts are no longer needed Disbelievers with 
low convictions eventually begin to change their private beliefs, becoming believers ın the 
norm With no need now to hide their posturing, they stop enforcing, which reduces pressure 
on other disbelievers At time 560, there 1s a steep reverse cascade from false enforcement 
to widespread deviance, and by time 600, two-thirds of the population 1s refusing to comply 


enforcing for the longest time and are the least committed to their beliefs, 
begin to convert As soon as disbelievers become true believers they are 
no longer complying because of social pressure and thus no longer feel 
the need to prove the sincerity of their compliance by publicly pressuring 
others Nor ıs the weak-willed true believer willing to enforce out of 
conviction unless the compliance level approaches zero (eq [2]) When 
these converted disbelievers stop enforcing the norm, there 1s less local 
pressure on the remaining disbelievers, which also allows those with stron- 
ger convictions to stop enforcing, further reducing local pressure A new 
equilibrium then obtains, characterized by behavior that 1s largely vol- 
untary, with minimal enforcement (limited to a few of the original high- 
conviction true believers, true disbelievers with high convictions located 
near them or their converts, and false disbelievers who enforce out of 
pressure from their true disbeliever neighbors) 

Once support for the unpopular norm collapses, a new cascade of false 
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enforcement does not begin again In order to start a new cascade, the 
original true believers need deviant neighbors with low convictions who 
can be pressured into false enforcement (to prove their sincerity) However, 
those with relatively low convictions have now been converted and are 
no longer deviant Although the original true believers still have strong 
convictions, their newly converted but weak-willed neighbors now comply 
with the norm, but their convictions are not strong enough to motivate 
them also to enforce their views The conversion of private beliefs has 
robbed the true believers of the spineless imposters that are needed to 
launch a successful cascade—the tinder needed for the fire Paradoxically, 
although the norm 1s not as unpopular as before, the population has been 
inoculated against another cascade by the conversion of those with low 
convictions 

Note the contrast with the collapse 1n Andersen’s fable In the original 
story, the spell collapses when a child laughs at the naked emperor Our 
model shows that the spell can also collapse for the opposite reason— 
when the adult sycophants actually begin to believe they can see the 
emperor’s clothes—a counterintuitive but logical possibility that has gone 
largely unnoticed 1n the literature on social control 


Experiment 3 Effects of Network Topology 


As a test of the robustness of our results across network topologies, we 
repeated experiments 1 and 2 using von Neumann neighborhoods as well 
as Moore neighborhoods of varying depth As an additional robustness 
test, we used a Voronoi diagram that relaxes the spatial regularity of 
lattice networks and allows degree to vary over the population of agents, 
as illustrated 1n figure 4 Across all these network structures, the results 
were qualitatively the same, ındıcatıng that the governing dynamics do 
not depend on network regularity (which 1s removed ın the Voronoi dı- 
agrams) or clustering (which 1s zero 1n von Neumann neighborhoods and 
increases with neighborhood depth 1n Moore geometries) 

Further testing revealed that the spread of false enforcement does de- 
pend on network structure, but ın a way that is quite surprising A growing 
body of theoretical research demonstrates that increasing the proportion 
of random ties in a regular network dramatically increases the propagation 
rate of cascades (Newman 2000, Kleinberg 2002) We therefore tested the 
effect on cascades as we perturb network order using Watts’s “rewiring” 
technique, replacing local ties with random ties with probability P Watts 
shows how a very small number of random bridge ties between otherwise 
distant pairs is sufficient to drastically reduce the characteristic path 
length, with minimal reduction ın clustering The world remains “small” 
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FIG 4 —Voronoi diagram The shaded cells are neighbors of the clear cell ın the center 
of the neighborhood The network retains the spatially ordered relations of grid structures 
but without their structural regularity 


Qe, highly clustered) but with remarkably few “degrees of separation” 
between any two randomly chosen nodes 

One might then expect that random rewiring would also promote the 
spread of false enforcement of unpopular norms The surprising result 1s 
that random rewiring not only failed to noticeably increase the rate of 
propagation, but ıt actually inhibited cascades as the level of randomness 
increased above a critical value of P Figure 5 illustrates the effect of 
reducing network order on the fraction of disbelievers who enforce the 
norm For a significant part of the small world regime (P < 01), cascades 
of false enforcement are as robust as 1n regular graphs (P = 0) However, 
for values of P> 01, there 1s a noticeable decline 1n the magnitude and 
frequency of cascades, and for P > 1 cascades are entirely precluded 

The unexpected inhibiting effect of random rewiring is due to the de- 
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Fıc 5 —Effect of network perturbatıon (based on 1,000 agents, averaged over 100 rep- 
hcations at each setting) A logarithmic scale on the horizontal axis highlights the steep 
drop-off in the spread of false enforcements as P 1s increased The sharp increase in the 
effect of P at about 01 indicates a phase transition Below that level, random ties have 
little effect, and for P> 1 cascades do not occur at all 


creasing fraction of overlap between neighborhoods as network order 
decreases, as illustrated ın figures 6 and 7 Suppose node A ın figure 6 1s 
activated, along with all of A’s eight neighbors With P = 0, nonadjacent 
agents A and B share three common neighbors, giving them an overlap 
of 375 So long as B’s activation threshold 1s below 375, the cascade 1s 
guaranteed to move beyond A's neighborhood, even if none of B's other 
neighbors are activated 

Figure 7 shows how rewiring reduces the average overlap between 
neighborhoods A is now connected to otherwise distant nodes, creating 
a shorter characteristic path length for the network However, A and B 
now have only one neighbor ın common instead of three Pressure from 
A’s neighborhood may be sufficient to induce B to comply with the norm, 
but with K = 125, it ıs not sufficient to induce B to enforce More gen- 
erally, as the network 1s perturbed, either by rewiring existing ties or by 
adding new ties, the average overlap between neighborhoods decreases, 
reducing the chance that a neighborhood of enforcers will have sufficient 
overlap with nearby neighborhoods to propagate the cascade 
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Fic 6 —Regular neighborhoods The shaded cells are A's neighborhood (enforcing the 
norm), the hashed cells are B’s neighborhood (deviants), and the three shaded hashed cells 
are the common neighbors of A and B (enforcing the norm) With K = 125, if the strength 
of B’s conviction 1s below 25, the pressure from these three neighbors will be sufficient to 
make B enforce as well 


DISCUSSION 


The results we report, across all three experiments, tell a very similar 
story cascades of false enforcement of an unpopular norm depend on the 
spread of misinformation about the distribution of support for the norm 
Unpopular norms thrive on local misrepresentations of the underlying 
population distribution Simply put, it 1s a sampling problem This 1s 
more easily appreciated if we examine various conditions in which we 
found that cascades faz 


1 ın an unembedded (fully connected) population, 

2 1n an embedded population with a small number of randomly dis- 
persed true believers, 

3 when random ties reduce the overlap between local neighborhoods 


In the first case, the cascade fails because disbelievers have an accurate 
estimate of the true distribution of public support for the norm Similarly, 
in the second case, the cascade fails because the distribution of true be- 
lievers in each neighborhood does not deviate very far from the underlying 
population distribution Hence the information 1s not sufficiently ınac- 
curate to persuade disbelievers that the norm 1s far more popular than it 
really 1s 

In the third case, random rewiring gives disbelievers access to infor- 
mation from outside their Moore neighborhoods These random ties to 
otherwise distant nodes give disbelievers a more representative sample 
and thus a more accurate picture of the true state of the world (see Merton 
1968, O'Gorman 1986) Although we discuss this effect as the zmhzbitton 
of the diffusion of social pressure, we could also frame the effect as the 
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Fic 7 —Perturbed neighborhoods A and B now have only one neighbor m common out 
of eight (overlap 1s 125) With K = 125, B can no longer be pressured into enforcing the 
norm by members of A’s neighborhood 


promotion of the diffusion of information Random ties inhibit cascades 
of misinformation by providing access to a more representative sample 
of the true population ” 

'The results also converge around a second theme, one that calls into 
question the conventional wisdom among sociologists ranging from func- 
tionalists to utılıtarıans Despite deep differences ın their theoretical ap- 
proaches, the functionalist and choice-theoretic accounts converge around 
the prevailing 1dea that norms are enforced because they are useful, either 
to society at large (1n functionalist accounts) or to those who enforce them 
(ın choice theory) A prominent social scientist sums up the consensus 
position very succinctly by noting that “norms of social behavior” can be 
viewed as “reactions of society to compensate for market failures ” In 
effect, when markets create a mess of things, society creates norms with 
which to clean 1t up Was this statement from a functionalist? On the 
contrary, 1t was the economist Kenneth Arrow (1971, p 22), expressing a 
view recently echoed and underscored by game theorists Bowles and 
Gintis (2001, p 6) and by Hechter and Opp (2001, p xvi) “The view that 
norms are created to prevent negative externalities, or to promote positive 
ones,” write Hecther and Opp, “1s virtually canonical 1n the rational choice 
literature ” 

The emperor’s dilemma tells a different story, one that invites us to 
revisit our sociological intuitions about how and why norms emerge and 


7 In a study of norm perception among five vegetarian communities, Kitts (2003) found 
support for the hypothesis that local interaction shapes the accuracy of norm percep- 
tion However, his research showed that structures encouraging local interactions led 
to more accurate norm perception because individuals were more likely to disclose 
their deviance with a friend, and close friendships tend to be clustered 
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spread Contrary to both functionalist and choice-theoretic accounts, 
norms do not necessarily solve social dilemmas, correct market failures, 
or promote social welfare Norms can also create social dilemmas and 
undermine social welfare When that happens, the real culprits are not 
the true believers, whose motivation for enforcement increases with the 
level of deviance The key to the emergence of an unpopular norm 1s the 
cascade process centered on the self-reinforcing motivations of those who 
succumb to social pressure Like a witch hunt, the process can quickly 
spiral up into a powerful (and dangerous) social movement 


A CAUTIONARY CONCLUSION 


We have used a highly specific model of social influence to explore the 
propagation dynamics of unpopular norms We do not mean to suggest 
that we believe this 1s the only possible specification of social influence 
Rather, we used this model to show how these cascades are highly sensitive 
to the structural conditions in which they occur, and to show how com- 
putational models are useful tools for getting leverage on embedded dy- 
namics that would otherwise be mathematically intractable We believe 
ıt will prove useful to explore alternative models of the influence process, 
as a way to more precisely specify the loosely used concept of “peer pres- 
sure” as the motivation to enforce unpopular norms Using agent-based 
models, three modes of influence can be compared 


1 Conformist influence, which is based on the false belief that others 
are correct, asın herd behavior (Banerjee 1992) and information 
cascades (Bikhchandani, Hirshleifer, and Welch 1992) This process 
applies to the spread of beliefs that are uncertain (Sherif 1936) rather 
than unpopular (Asch 1951) 

2 Imaginary enforcement pressure, which 1s based on the false belief 
that those who comply will also enforce, when 1n fact they will not, 
as ın the Andersen fable, in which a single violation (eg, a child 
who laughs at the emperor) 1s sufficient to disturb a highly fragile 
equilibrium 

3 Real enforcement pressure, which 1s based on false compliance and 
false enforcement, as ın the model used ın this study 


Note that these elaborations of our model to include alternative specifi- 
cations of the influence dynamics are not intended to make the model 
more realistic, nor do we know of any empirical studies that clearly favor 
one mechanism over another Rather, we propose these alternative models 
as dynamic thought experiments ın a program of systematic theoretical 
research that explores a series of what zfs The emperor’s dilemma model 
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demonstrates the usefulness of this approach, especially for those 1nter- 
ested ın structural determinants of microsocial influence processes un- 
derlying puzzling population dynamics 

We have also limited ourselves to a computational model of these dy- 
namics, and here again, we do not mean to suggest that other modeling 
techniques, such as game-theoretic approaches, would not provide ad- 
ditional insights that we may have missed Nevertheless, we encourage 
readers to appreciate the particular strengths of this methodology Agent- 
based models of dynamic social interaction are more tractable (but less 
generalizable) than mathematical modeling and more rigorous (but less 
nuanced) than natural language (Hanneman, Collins, and Mordt 1995) 
Expressed ın natural language, a theory of enforcement of unpopular 
norms has the intuitive appeal of Andersen’s fable, but we cannot know 
if the intuitions can be trusted, or which assumptions were necessary for 
the results A mathematical model ıs useful for 1dentifying possible equi- 
libria ın a fully connected or completely random network but not for 
studying the dynamics of propagation across a population embedded in 
a complex network structure The latter turns out to be decisively 1m- 
portant for the spread of unpopular norms 
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An Individual-Based Model of Innovation 
Diffusion Mixing Social Value and Individual 
Benefit! 


Guillaume Deffuant, Sylvie Huet, and Frédéric Amblard 
Laboratoire d’Ingénterte pour les Systèmes Complexes 


The authors propose an ındıvıdual-based model of innovation dif- 
fusion and explore ıts main dynamical properties In the model, 
individuals assign an a priori social value to an innovation which 
evolves during their interactions with the “relative agreement” 1n- 
fluence model This model offers the possibility of including a mı- 
nority of “extremists” with extreme and very definite opinions In- 
dividuals who give a high social value to the innovation tend to 
look for information that allows them to evaluate more precisely 
the individual benefit of adoption If the social value they assign 1s 
low, they neither consider the information nor transmit ıt The mam 
finding 1s that mnovations with high social value and low individual 
benefit have a greater chance of succeeding than innovations with 
low social value and high individual benefit Moreover, 1n some 
cases, a minority of extremists can have a very important impact 
on the propagation by polarizing the social value 


INTRODUCTION 


The agent-based model of innovation diffusion described ın Deffuant 
(2001) and Deffuant et al (20025) was initially targeted on the diffusion 
of green practices among farmers We applied it to different types of green 
practices (e g , landscape maintenance, reduction of inputs) from different 
study zones 1n Europe In this article, we present an evolution of this 
model and consider its application to more general processes of innovation 
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diffusion like the mobile phone, the Internet, contraception, organic prod- 
ucts, genetically modified organisms, and cloning 

We refer to znnovatzon diffusion n a sense close to the one proposed 
by Valente (1995, p 2) “Diffusion of innovations 1s the spread of new 
ideas, opinions, or products throughout a society Thus, diffusion 1s 
a communication process in which adopters persuade those who have not 
yet adopted to adopt” More precisely, our model 1s designed for mno- 
vations 1n which both social values and individual payoff are considered 
For instance, farmers give green practices promoted and supported by 
the European commission a social value according to their opinion about 
environmental preservation and the image they have of farming If they 
are interested 1n the green practices, they try to get information to evaluate 
the individual benefit of adoption, including an evaluation of the work 
to be done and whether the subsidies cover the costs or not The final 
adoption 1s based on a trade-off between both social value and individual 
benefit We beheve that such behavior 1s fairly general and can be ex- 
tended to other innovations For instance, 1n considering the adoption of 
a mobile phone, a social value can be taken into account as well as more 
strictly 1ndividual benefits or risks for health 

The most popular model of innovation diffusion, the threshold model 
(Granovetter 1978), already considers the trade-off between a social value 
and an individual benefit from the innovation In this model, the social 
value ıs directly related to the proportion of adopters ın the 1ndividuals 
social network, representing the diffusion as a contagion process (Rogers 
1983) When the number of adopters increases ın the network of peers, 
the pressure for adoption increases The threshold ıs the proportion of 
adopters 1n the 1ndividual's social network that 1s necessary to convince 
him or her to adopt This theory has been applied to very different sub- 
jects farming innovations, family planning practices, medical technology, 
policy innovation, and language (see Rogers [1983] for a comprehensive 
review) 

Threshold models were particularly used and studied 1n the social sim- 
ulation research field, which aims at reproducing social dynamics 1n com- 
puter models (Gilbert and Conte 1995, Bousquet et al 1993, Gilbert and 
Troitzsch 1999), because ıt happens that threshold models fit exactly into 
the most popular type of social simulation model automata networks 
Automata networks can be used to simulate the threshold model of ın- 
novation diffusion or new product growth 1n marketing Each automaton 
represents an individual in a social system, and the links of the graph 
can represent working relations, friendship, or any type of contact, the 
binary state of the automaton corresponds to the adoption or nonadoption 
of the innovation Blume (1993, 1995) and Elhson (1993) consider autom- 
ata networks implementing the threshold model, in which the threshold 
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is related to an intrinsic payoff to the considered agent This leads to the 
definition of the agent’s utility as the sum of an intrinsic payoff and the 
proportion of the agent’s neighbors who adopted the behavior Several 
variants of this model can be found (Blume 1993, Young 1998) This 
framework led to many interesting theoretical and practical results (Young 
1999, Weisbuch and Boudjema 1999) 

However, our study on agri-environment measures pointed out three 
strong limitations to threshold models of innovation diffusion 


1 The most important weakness 1s the hypothesis that people have an 
a prior: knowledge of their individual benefit This hypothesis is not 
very realistic for complicated innovations The analysis of farmers’ 
interviews revealed that only previously interested farmers invested 
some time to take into account available information and to evaluate 
their potential individual benefit In over 350 interviews carried out 
in nine different sites in Europe, 73% of nonadopters did not look 
for more information after having heard for the first time about the 
measures On the contrary, 70% of adopters did look for information 
after they first heard about them Therefore, there 1s a need to con- 
sider the specific dynamics of information propagation 

2 Moreover, the dynamics of mutual influence on social value are more 
complex than a simple contagion effect Farmers have more or less 
strong convictions on their social values, and this affects their ınflu- 
ence capacity as well as their open-mındedness to other opinions A 
large panel of work related to group polarization (see Moscovıcı and 
Doise [1992] for a review) illustrates this complexity 

3 Finally, we noted that the decision 1s not binary (adoption versus 
nonadoption) We observed several farmers in an intermediate de- 
cision state in which their decision 1s uncertain 


We tried to incorporate these features ın our model while keeping it as 
simple as possible Our modeling approach 1s a hybrid between the cog- 
nitive agent approaches ın computer science (Conte and Castelfranchı 
1995, Ferber 1999, Muller 1996, Conte 1999) and the models of cellular 
automata networks which are more inspired by physics (Weisbuch 1991, 
Blume 1993, Young 1998) It builds on earlier versions proposed by Chat- 
toe and Gilbert (1999) The inspiration in sociology relates clearly to the 
field of mnovation diffusion The main features of the model are the 
following 


1 The model represents dynamics of discussions in a social network 
of individuals Individuals send messages to each other containing 
their social opinion and their information if available The discus- 
sions are triggered by messages from the media that reach individuals 
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at random, with a given frequency, the individuals then propagate 
the discussions ın their network We assume that the information 
enables individuals to evaluate the individual benefits of adopting 
an innovation 

2 We assume that individuals have an a priori opinion about the social 
value of an innovation, even the first time they hear about ıt The 
dynamics on the social values are based on the “relative agreement” 
model that we studied earlier (Deffuant et al 2001, Weisbuch et al 
2002, Deffuant et al 2002a) This model shows some similarity with 
the results described ın Moscovici and Doise (1992) ıt can lead to 
polarization under the influence of a minority of extremists when 
there are a lot of discussions 

3 Only individuals having a high social opinion of the innovation pay 
attention to the information They use 1t to perform an evaluation 
of their potential individual benefit ın adoption and transmit it to 
their associates 

4 Individuals adopt only if a global evaluation, including the social 
value and the individual benefit, 1s good enough with some certainty 
This implies that all adopters evaluated their potential individual 
benefit 


We perform a systematic study of the model we observe the average 
final number of adopters over several runs for different values of the main 
parameters—in particular, the definition of the a priori distribution of 
social values and the function of individual benefit evaluation The anal- 
ysis of the results 1s based on the observation of the social value distri- 
bution and the information possession at the end of the simulation The 
main results of this analysis are 


1 In the model, an innovation with a high social value and a low 
individual benefit better propagates than an innovation with a low 
social value and a high individual benefit The reason 1s that the 
propagation of mformation has a chance of being blocked in the 
latter case, which prevents people from evaluating their individual 
benefit 

2 A mınority of extremists can strongly modify the adoption, when the 
density of the social network and the frequency of discussion are 
high 

3 Low levels of adoption can be due to high uncertainty about the 
innovation 


Below, we recall the structure of the model and its main hypotheses 
We then devote some time to the exploration of the model and analysis 
of the results Finally, we propose some conclusions 
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INDIVIDUAL-BASED MODEL OF INNOVATION DIFFUSION 


We first propose a global picture of the model Then, we give more details 
about the state variables and the dynamics 


General Overview 


The principles of the model are the following 


1 


Individuals are related to each other through a social network, which 
can be more or less dense The network 1s a particular type of small 
world network (Watts and Strogatz 1998, Milgram 1967) with a 
mayority of links (95%) corresponding to geographic proximity (under 
a given distance threshold), and others (5%) drawn at random In 
this article, we do not investigate a precise influence of the network 
structure, but rather rough results related to the density of links We 
could have used totally random networks and gotten similar results ? 
We assume that the mass media regularly send messages about the 
innovation, reaching individuals at random People who receive the 
messages tend to discuss the innovation with their colleagues, and 
we model a propagation of the discussions ın the social network In 
agent-based terminology, this propagation of the discussions (ex- 
plained is greater detail below) rules the scheduling of the 
interactions 

The state of an individual contains a social value with an uncertainty, 
an information state (which 1s a Boolean), and other variables A 
discussion 1s modeled as a message exchange only about the social 
value and the information state 

The innovation 1s interpreted within an existing social context, and 
it 1s given an a priori social value, which 1s more or less positive 
For the sake of simplicity, we assume that the initial social value 1s 
drawn from a normal distribution The mean and standard deviation 
of this distribution allow us to define innovations with on average 
an a priori high or low social value, more or less homogeneously 
distributed 1n the population 

We postulate that individuals influence each other’s social values 
when they have discussions The model of social value influence 1s 
inspired by previous research (Nowak and Vallacher 1998, Axelrod 
1997) However, the originality of this part of the model ıs the use 
of continuous variables for the opinions and the introduction of an 
uncertainty interval around these opinions More convinced people 


? We studied more precisely the influence of different types of small world networks 
on the relative agreement model in Amblard and Deffuant (2004) 
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(more certain), who are more influential, can be modeled If these 
more convinced individuals are initialized with extreme opinions, 
the model reproduces some aspects of the polarizations observed ın 
Moscovici and Doise (1992, see Deffuant et al [2001], Weisbuch et 
al [2002], and Deffuant et al [2002a] for more details) 

6 ‘To take into account the results of our study on the adoption of agrı- 
environmental measures, the individuals having a low a prion social 
opinion about the innovation do not consider the available infor- 
mation and therefore do not transmit ıt On the other hand, ındı- 
viduals having a high a priori social opinion pay attention to the 
information and tend to integrate ıt (their Boolean value changes to 
true) and to transmit it (1t 1s supposed that this information 1s not 
strategic) We particularly want to investigate the consequences of 
this attitude toward information on the final adoption level 

7 When interested individuals (having a high social opinion) get 1n- 
formation, they evaluate the potential individual benefit of adoption 
Here, to simplify the model, we draw the individual benefit from a 
normal distribution * When we vary the mean and the standard 
deviation of this distribution, we represent innovations that are more 
or less beneficial for the individuals, with a more or less homogeneous 
distribution of this benefit ın the population 


We now proceed to a more complete presentation of the model 


State Variables of an Individual 


Án individual 1s described by the following state variables, with their 
type in parentheses 


1 Social opinion (real number) —The social opinion can be negative 
or positive, initially drawn from a normal distribution N(m,, o,) 

2 Social opinion uncertainty (real number) —The term uncertainty 1s 
used for convenience In fact, this value represents a mix between 
uncertainty, conviction, and openness to the opinion of others For 
the sake of simplicity, 1t 1s the same value U for every individual, 
except for extremists who have a much smaller value 

3 Individual benefit (nil or real number) —Inıtıalızed with the value 
nil, it is drawn from a normal distribution N(m,, o,) when the 1n- 
dividual evaluates his or her potential benefit 

4 Individual benefit uncertainty (nil or real number) —Initialized with 


” In the model we applied to agri-environmental measures, the individual benefit eval- 
uation was performed with an economic model, taking as inputs the characteristics of 
the farm and the requirements of the agri-environmental measure 
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the value 721, ıt 15 the same for every individual when the individual 
evaluates his or her potential benefit 

5 Global opinion (real number) —If the individual benefit is nzi, it 1s 
equal to the social opinion If not, it 1s the average of the social 
opinion and the individual benefit 

6 Global opinion uncertainty (real number) —If the individual benefit 
uncertainty 1s 7:1, it 1s equal to the social opinion uncertainty If not, 
it is the average of the social opinion uncertainty and the individual 
benefit uncertainty 

7 Information (yes or no) —This variable expresses whether the ın- 
dividual 1s able to evaluate his or her individual benefit and transmit 
the information 

8 Interest (no, maybe, or yes) —The value of this variable 1s based on 
the global opinion and uncertainty 

9 Decision (not concerned, information request, no adoption, preadop- 
tron, adoption) —The value of this variable is based on the interest 
value and the information state 


During discussions, individuals exchange messages containing the val- 
ues of the following state variables social opinion, social opinion uncer- 
tainty, and information The social opinion and its uncertainty are mod- 
ified during discussions according to the relative agreement model (see 
below) The rules modifying the information state are also specified below 
These modifications can change the interest and the decision states, as 
will be explamed The propagation of the discussions (scheduling of ın- 
teractions) 1s also described 


Decision Process 


The decision state 1s based on the interest and the information states The 
interest state can take three values (xo, maybe, yes), based on the global 
opinion (see fig 1) 


1 Ifthe global opinion plus the global opinion uncertainty 1s negative 
(respectively positive), then the interest 1s 70 

2 Ifthe global opinion minus the global opinion uncertainty 1s positive, 
then the interest 1s yes 

3 Otherwise, the interest 1s maybe 


The interest states are essential ın the dynamics of the information and 
decision states The decision state 1s rules by the state diagram of figure 
2 The decision process takes into account the interest state and the 1n- 
formation state of the individual When the individual has no information, 


1047 


American Journal of Sociology 


Global opinion Interest state 


min max opinion axis 


Maybe 
min max 


FIG 1 —Definition of the interest state The global opinion and uncertainty are equal to 
the social value and uncertainty if the individual benefit 1s not computed If the individual 
benefit ıs computed, the global opinion is the average of the social opinion and individual 
benefit, and the global uncertainty 1s the average of the corresponding uncertainties The 
global opinion plus or minus the global uncertainty 1s compared to zero This comparison 
defines the interest state 


and by hypothesis, a benefit and uncertainty of value nl, there are two 
possibilities 


1 


If the ındıvıdual's interest state 1s no, then the decision state 1s not 
concerned In this case, the individual does not pay attention to the 
information he or she may receive 

If the individual’s interest state 1s maybe or yes, then the decision 
state 1s znformation request In this case, when the individual receives 
a message with information, he or she has the probability w of un- 
derstanding and using it for the evaluation of personal benefit To 
simplify, the value of the personal benefit 1s drawn from the normal 
distribution N(m,, o,), and the uncertainty 1s supposed fixed 


As soon as an individual evaluates his or her individual benefit, the 
decision state changes, and depending on the interest state of the ındı- 
vidual, two possibilities appear again 


1 
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If the ındıvıdual's interest state 1s xo or maybe, then the decision 
state becomes no adoption This means that the individual benefit 
was negative enough to drive down the global interest This state 1s 
not definitive because messages with very high social opinions may 
change the interest state to yes 


Innovation Diffusion 








Not-concerned | ^ ][interest = No] 


[interest = No] [(interest = Maybe) or (interest = Yes)] 










Information 
request 


[information received, 


ersonal evaluation] |,  İl(interest = Maybe) or (interest = Yes)] 








((ınterest = No) or 
(interest = Maybe)] 





No adoption | İl(interest = Maybe) or (interest = No)] 





[interest = Yes] ((ınterest = No) or (interest = Maybe)] 






[interest = Yes] [interest = Yes] 


Pre-adoption 


[(1nterest = Yes) for p periods] 


FıG 2 —State diagram of the decision process The squares represent the states and the 
comments near the arrows the conditions for the transitions. The interest state rules the 
attitude toward the information and also the final decision once the individual has an opinion 
about the personal consequences of adoption 


2 Ifthe individual’s interest state 1s yes, then the decision state becomes 
preadoption the individual is ready to adopt, but takes reflection 
time to be sure of the decision During this period, social influences 
may change his or her interest to 70 or maybe, and by consequence, 
the decision state to no adoption 


Finally, 1f the ındıvıdual remained interested during a given number of 
time steps, noted p, he or she adopts, and the decision state becomes 
adoption 


The “Relative Agreement” Model, Ruling the Influence on the Social 
Values 


During pair interactions, the social opinion and uncertainty are modified 
according to the relative agreement (RA) model of social influence (Def- 
fuant et al 20028) The main features of this model are the following 


1 Opinions with low uncertainty are more influential 
2 When the overlap between the segments formed by the opinion plus 
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and minus the uncertainty 1s too small, there 1s no influence 


More precisely, let us consider opinion segments s, = [x, — zə, x, + u,] 
and s, = [x, — u,,x, + u,] (see fig 3) We define the agreement of individual 
2 with 7 (it 1s not symmetric) as the overlap of s, and s,, minus the non- 
overlapping part The overlap k, 1s given by 


h, = min(x, + u, x, + u,) —max(x, — u, x, — u) (1) 


The nonoverlapping width 1s 
2 u,—h, (2) 
The agreement 1s the overlap minus the nonoverlap 


h,—(2 u,-h,)=2 (uu) (3) 
The relative agreement 1s the agreement divided by the length of segment 
S 


Sod (4) 


If h, >u, then the modifications of x, and zz, by the interaction with z are 
multiplied by the relative agreement 


x, =x, tp (5 - ) (x, — x,), (5) 


uu ZU tp (5 - ) (u, — u,), (6) 


where p 1s the rate of the dynamics If k, € zz, there 1s no influence of : 
on 7 

We studied the properties of this model (see Deffuant et al 2001, Def- 
fuant 2001), ın particular, with a population including extremists (Def- 
fuant et al 2002a) This study exhibits some conditions under which the 
majority of the population, initially moderate, becomes extremist We also 
studied the influence of various social network structures on the conver- 
gence type (Amblard and Deffuant 2004) In the present model, the initial 
distribution of opinions 1s drawn from a normal distribution instead of a 
uniform one This modifies a bit the global behavior of the model, as 
shown ın below 
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Fic 3 —Individual : (opmion x, and uncertainty z,) influences individual 7 (opinion x, and uncertaintyz,) k, is the overlap between segments : and 
J, and 2u, — k, ıs the nonoverlapping part of segment 7 On the nght, the dotted lines represent the segment before interaction, the plain lines after 
interaction 
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Propagation of the Discussions 


We assume that the media regularly send messages to the population At 
each time step, individuals chosen at random receive the media message, 
which triggers a cascade of discussions, propagating ın the network ın 
the following time steps We assume that the first individuals who receive 
the message from the media tend to discuss 1t with their neighbors, who 
themselves then tend to discuss ıt with their neighbors at the next time 
step, and so on We assume that this tendency to discuss decreases linearly 
with the time elapsed after the reception of the message from the media 
by the individual who initialized the cascade 

Let £, be the time step at which an ındıvıdual 2, receives an information 
message from the media At time step £, + £, all the individuals reached 
by the discussion during the time interval £ from individual z, (4, included) 
send messages to a proportion of their neighbors which 1s given by 


while 1— y 4>0 


f 1—” 40, C) 


şi 
bü) = 0 

where O < y < 1 ıs a parameter of the model, which rules the tendency of 
the population to discuss the innovation When y ıs close to one, the 
discussions do not propagate On the contrary, when y 1s close to zero, 
the discussions tend to propagate far ın the network, the number of dis- 
cussions triggered by one individual receiving a message from the ınstı- 
tution first increases and then decreases progressively 


EXPLORATION OF THE MODEL 


We explore the model with an experimental design on a subset of param- 
eters, the others being fixed we run the models several times for these 
values and compare the results In the first step, we analyze the evolution 
of the social opinion distribution We use this first step to explain the 
variations of the final proportion of adopters and informed individuals 


Experimental Design 


This experimental design focuses on 


1 initial distribution of the social opinion and the distribution of per- 
sonal evaluation, 

2 parameters affecting the evolution of the social opinion and the prop- 
agation of information (the average size of the individual’s social 
network, the frequency of information diffusion by the institution) 
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This decision comes from larger investigations on the model, which we 
omit here for the sake of simplicity 

Fixed parameters —We fixed the main parameters of the internal dy- 
namics, because this 1s not our main focus ın the experiment (see table 
1) In particular, we kept the population size equal to 1,000 individuals 
Previous investigations showed that the main features of the dynamics 
are not sensitive to population size above 50 individuals 

Moreover, we fixed the standard deviation of the individual benefit 
evaluation distribution, the uncertainty of this evaluation, and the un- 
certainty of the extremists (1f any) (see table 2) The values are discussed 
below 

Varying values of the experimental design —-To simplify the presenta- 
tion, we distinguish two parts 1n the experimental design The first part 
is related the parameters ruling the type of convergence of the social 
opinion distribution (see table 3) 


1 The density of the network and the frequency of the media messages 
are aggregated into a single variable with two values (/z2gh or low) 
The value hzgh means that at each time step, each individual has a 
probability of 0 4 of receiving an information message from the me- 
dia, and each individual has four associates on average in the social 
network The value low means that at each time step, each individual 
has a probability of 01 of receiving an information message from 
the media, and each individual has one associate on average 1n the 
social network 

2 The standard deviation o, of the initial distribution of social opinion 
has two possible values 01 or 03 

3 There can be no extremists or 15% of extremists ın the population 
In the latter case, the extremists are the 15% of individuals having 
the highest social opinions 

4 The uncertainty of the moderate (or all the individuals of the pop- 
ulation when there are no extremists), denoted U, has two possible 
values 005 or 03 


The second part of the experimental design relates to the mean values 
of the initial distribution of the social opinions (m,) and the mean value 
of the distribution from which the evaluation of the individual benefit 
(m,)1s drawn We want to study more particularly different combinations 
of these values (for instance, an innovation having on average an a priori 
high social value m the population, but on average a low individual 
benefit) The considered combinations are given ın table 4 

All the combinations between both parts of the experimental design 
are considered, which leads to 128 parameter configurations A run of a 
simulation lasts 500 steps, which ensures a stabilization of all the state 
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TABLE 1 
FIXED VALUES OF THE DYNAMICS PARAMETERS 








Variable Value 





& (the probability of transmitting the knowledge 
necessary to the economic evaluation of the mea- 


sure) 5 
y (discussion propagation) 3 
g (intensity of the social influence) 1 
p (reflection time necessary for the adoption deci- 
sion) 15 time steps 
N (number of individuals in the population) 1,000 





variables We perform 20 runs for each parameter configuration, which 
makes 2,560 simulations in total We compute then the average state of 
the population after 500 steps (proportion of adopters, informed 1ndivid- 
uals, individuals 1n different interest states, etc ) over the 20 runs 


Types of Evolution of the Social Opinion Distribution 


In the first stage, we consider the evolution of the social opinion distri- 
bution when the parameters of the first part of the experimental design 
vary The other part of the experimental design has no influence on the 
type of evolution of this distribution 

The results yield a typology which 1s very close to the one proposed ın 
Deffuant et al (20025) The main difference 1s the introduction of two 
types one ın which the opinions remain stable over time (because of a 
very sparse network), and a mixed case between central and one extreme 
convergence 


1 The individuals keep their social opinion and uncertainty 1s almost 
unchanged during the whole process We call this case “stable con- 
vergence" (fig 4) 

2 The social opinion of all the individuals converges toward the initial 











TABLE 2 
FIXED VALUES OF THE EXPERIMENTAL DESIGN RELATED TO THE 
OPINIONS 
Variable Value 
o, (SD of the individual benefit distribution) 1 
u, (uncertainty of the ındıvıdual benefit evaluation) 
01 
u, (uncertainty of the extremists’ social opinion) 01 
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TABLE 3 
VARIABLES RULING THE TYPE OF CONVERGENCE OF THE SOCIAL OPINION 


Network Media Low High Low High Low High Low High Low High Low High Low High Low High 


0, 0 1 0 3 0 1 0 3 0 1 0 3 0 1 0 3 
Extreme 0 15 0 15 
U 05 3 
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TABLE 4 
AVERAGE OF THE INITIAL DISTRIBUTION OF SOCIAL OPINIONS AND OF THE 
DISTRIBUTION FROM WHICH THE INDIVIDUAL BENEFIT Is DRAWN 


average opinion, the evolution of the population tends toward a 
consensus on the initial average opinion We call this case “central 
convergence” (fig 5) 

3 A large part of the population has its social opinion which improves 
dramatically and converges to the values of the positive extreme of 
the initial distribution, with a small uncertainty We call this case 
“extreme convergence” (fig 6) 

4 A mixed case occurs where a part of the population 1s attracted by 
the extremists, and the other part has a central convergence We call 
this case “central extreme convergence” (fig 7) 


Table 5 shows the type of social opinion distribution evolution obtained 
for the different parameter configurations of the first part of the experı- 
mental design The stable configuration systematically appears when the 
number of discussions 1s low (media and network) The extremists’ effect 
appears of course only when extremists are present and particularly when 
o,/U 1s small, which is in agreement with the results obtained with a 


Opinion 





0 101 203 304 Time 


Fic 4 —Trayectortes of the individuals’ social opinions ın the case of “stable convergence" 
for the parameter values U = 005, o, = 03, no extremists, media network = low 
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Fic 5 —Trajectories of the individuals’ social opinions ın the case of “central convergence” 
for the parameter values U = 03, o, = 03, no extremists, media network = gh 


Opinion 





Time 
0 100 200 300 


FıG 6 —Trajectories of the individuals’ social opinions ın the case of “shift to positive 
extremism” for the parameter values U = 03,6, = 01, extremists = 15%, media network 
= high The extremists (15% of the population) have horizontal trajectories in the upper 
part of the figure 
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FıG 7 —Trajectories of the individuals’ social opinions ın the case of “central extreme 
convergence" for the parameter values U = 0 3, o, = 03, extremists = 15%, media network 
= high The extremists (15% of the population) have horizontal trajectories ın the upper 
part of the figure 


uniform initial distribution of social opinion The central convergence 
takes place when there are no extremists and a high number of discussions 
(media and network) The central extreme takes place 1n intermediate 
situations These results can be analyzed more carefully and explained 
with more details, but that 1s not our objective 1n this article 


Final Proportions of Adopters and Informed Individuals 


We focus now on the final proportions of adopters and of individuals 
possessing the information, considering both parts of the experimental 
design The typology introduced 1n the previous paragraph will help us 
to interpret the results We distinguish different cases of combinations 
proposed in the second part of the experimental design 

The mean of the initial social opinion and the individual benefit eval- 
uation distributions are both negative —In figure 8, we consider the neg- 
ative couples of the distribution means m, = —02 and m, = —015 or 
m, = —015 and m, = —02 The diffusion of information ıs generally 
lower when m, = —0 2, but this does not lead to significant differences 
in the proportion of adopters We note that the proportion of adopters 1s 
most of the time very low (between 0 and 2%), which 1s not surprising 
However, the model can lead to a non-negligible number of adoptions 
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TABLE 5 


TYPE OF SOCIAL OPINION DISTRIBUTION EVOLUTION 


Stable Central Stable Stable Stable 


Central + 
Extreme 


SOCIAL OPINION EVOLUTION 


Stable Stable Central Central Stable Central Stable Extreme Stable 
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Fic 8 —Final adoption (horizontal lines) and information (vertical bars) proportions whe 
individual benefit are negative The results are the average over 20 replications 
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(around 10%) when the standard deviation of the initial distribution of 
social opinions is large, and a bit more when there 1s an extremist effect 
The large standard deviation initializes some individuals with an a priori 
high social opinion, and the extremist effect increases slightly their 
number 

Figure 9 gives an example of the evolution of the proportion of adopters, 
informed individuals, and of the different interest states on a particular 
run for a particular parameter configuration with extremists The number 
of uncertain individuals decreases rapidly when the number of informed 
individuals increases, because the evaluation of the individual benefit 1s 
negative, which decreases the global opinion when it 1s evaluated The 
shght increase of interested individuals at the beginning 1s due to the 
extremists (here the evolution of the social opinion distribution 1s *extreme 
central convergence ” 

We can enhance this extremist effect by increasing the uncertainty of 
the social opinion U to 07 In this case, one can reach more than 50% 
of adoption at the end of the simulation One could relate these dynamics 
to innovations that were ınıtıally very badly perceived except by a con- 
vinced minority and finally had a good diffusion because of the influence 
of this mmority One can think of sects or religious movements for 1n- 
stance, which were initially persecuted and therefore represented a risk 
for the early converted (negative personal evaluation) But these early 
converted were convinced enough of the innovation’s social value to ne- 
glect the personal risk 

The mean of the initial social opinion and the personal evaluation 
distributions ave both positive —Figure 10 shows the final proportion of 
adopters and informed individuals for the couples m, = 015 and m, = 
0 2, or ə, = 015 We note that the results are very similar for both couples 
When the uncertainty of the social opinion 1s low (U = 005), the pro- 
portion of adoption 1s quite high (more than 7096) A small uncertainty 
associated with a positive value of the opinion leads easily to an interest 
state of yes One can relate these dynamics to innovations that have a 
good social 1mage and are also valuable from a strictly personal point of 
view The mobile phone could be put 1n this category 

However, when the initial distribution of social opinion has a large 
standard deviation (0 3), we notice that the final adoption proportion 1s 
lower, as well as the proportion of informed individuals As illustrated 
by figure 11, which presents the time evolution of the population in a 
particular run for this parameter configuration, this 1s due to the fact that 
a proportion of individuals have an initial negative social opinion (they 
are not interested) and therefore do not get the 1nformation and do not 
evaluate their individual benefit If they did, they would probably become 
interested or uncertain, because the individual benefit 15 positive 
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Fic 9 —Example of evolution of the proportion of individuals in the different interest 
states, informed and adopters m, = —0 2 and m, = —015 U = 03,0, = 03, 1596 extrem- 
ists, media network = high 


With a larger uncertainty (U = 03), people tend to hesitate (decision 
state = maybe), which generates a high level of information, but a lower 
adoption proportion It can be only around 50% that the evolution of the 
social opinion 1s stable This can be related to innovations that have 
globally high social value, but with a high uncertainty, they could also 
be very bad One can think of the introduction of genetically modified 
organisms in agriculture, for 1nstance Although it 1s agreed that these 
innovations can generally bring improvements ın the yield and resistance 
of the plants, the possible negative consequences on the global ecosystem 
are unknown and feared Note that an extremist effect can lead to about 
100% adoption, compared to 70% without the extremist effect (when 
U = 03 and o, = 01) 

Intermediate cases —We now consider the intermediate couples 
m, = —O2andm, 0 15, or m, = 015 and m, = —02 These parameter 
configurations allow us to underline the asymmetry of the model between 
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FIG 10 —Final adoption (horizontal lines) and information (vertical bars) proportions when both the initial mean of the social distribution and the 
individual benefit are positive The results are the average over 20 replications 
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Fic 11 —Example of evolution of the proportion of individuals ın the different interest 
states, informed and adopters m, = 02 and m, = 015 U = 005, o, = 03, no extremists, 


media network = high 


the social and the personal opinions This asymmetry 1s due to the hy- 
pothesis that the social opinion governs the initial interest state 
In figure 12, we note the couples lead to significantly different adopter 
proportions and even more different proportions of unformed individuals 
Especially for small values of the social opinion uncertainty (U = 005), 
the proportion of informed individuals 1s lower than 40% when m, = 
—0 2 and higher than 70% when m, = 015 The reason ıs that when the 
social opinion 1s negative and the standard deviation of the social opinion 
1s small, the information propagates poorly because the interest state tends 
to be no In case 2, for the social opinion uncertainty U = 005, the 
proportion of individuals having the information 1s very close to the adop- 
tion proportion, indicating that it is the limiting factor On the contrary, 
in case 1, the information 1s possessed by almost the whole population 
(atleast 8096) It 1s therefore the difference of propagation of 1nformation 
which explains the difference between these cases Such a difference of 
dynamics can be related to the difficulty of diffusing 1nnovations that 
have a low cultural acceptabihty, even though they can bring a significant 
improvement of individual well-being Think about the condom, for ex- 
ample, which is not adopted because of its bad cultural image ın some 
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FıG 12 —Fınal adoption (horizontal lines) and information (vertical bars) proportions when the initial mean of the social distribution and the 
individual benefit are of opposite signs The results are the average over 20 replications 
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African societies, even though a large part of the population is HIV 
positive 

On the other hand, one can find examples of well-diffused behaviors 
which have negative consequences on individual well-being because of 
their good social image For instance, different types of risky behaviors 
diffuse among young people ın modern societies because they are very 
fashionable within the group, or play the role of necessary rıtuals of 
acceptence within the group 

However, the preponderance of the social opinion has limits, and when 
the personal evaluation becomes too negative compared to the social ben- 
efit, the diffusion is also very limited For example, when m, = 0 15 and 
m, — —02, the diffusion culminates around 3526, except when there 1s 
an extremist effect, which brings adoption up to 7096 In the latter case, 
as illustrated by a particular simulation ın figure 13, the number of ın- 
terested individuals increases rapidly while the number of uncertain ın- 
dividuals decreases because of the extreme convergence of the social opin- 
ion For many individuals, this high social value compensates for the 
negative individual benefit 


DISCUSSION AND CONCLUSION 


We proposed an extension of the threshold model of innovation diffusion 
We maintained two main hypotheses of the threshold models, which are 


1 astrong distinction between a social influence and a personal intrinsic 
payoff of the adoption, 

2 a decision of adoption based on a trade-off between both aspects of 
the innovation 


We introduced several supplementary hypotheses which were suggested 
by our study on the diffusion of agri-environmental measures In partic- 
ular, we considered a continuous social opinion and an uncertainty (which 
can also be interpreted as conviction and broadmindedness), and partıc- 
ular dynamics of social influences based on this uncertainty These dy- 
namics offer the possibility of simulating the diffusion of extreme opinions 
in a population We also introduced a variable of information about the 
innovation, which 1s necessary to make the evaluation of the ıntrınsıc 
payoff Moreover, we supposed that the attitude toward this information 
depends on the social opinion 

We are conscious that many of the assumptions behind the model are 
highly debatable the choice of the social dynamics, the dynamics of dis- 
cussion, the strong separation between social and personal opinions, the 
type of social networks, and more We made several of these choices with 
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Fic 13 —Example of evolution of the proportion of individuals ın the different interest 
states, informed and adopters m, = 0 15 and m, = —02 U = 03,0, = 0 1,1596 extremists, 
media network = high 


N 

poor or without empirical justification, and sociologists or psychologists 
might see them as strong and artificial simplifications We consider these 
proposals as first approximations which can be refined or totally changed 
in hght of empirical evidence 

However, one can criticize the model from an opposite point of view 
ıt may appear too complicated to be efficiently related to quantitative 
data We also partially accept this criticism Our attempts to relate the 
initial distribution of opinions and uncertainties with empirical data fróm 
questionnaires (see Deffuant 2001) were not entirely satisfactory We also 
had some difficulty in attributing concrete values to some parameters of 
the dynamics In this respect, the threshold model 1s easier to deal with 

Nevertheless, the richness and the interpretabihty of the different dy- 
namical behaviors of the model seem to us good arguments ın its favor 
The interpretability indicates that the complexity of the model remains 
humanly tractable, and that the model helps to bring a new way of 
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understanding complex social phenomena Is that not what we ask from 
a model? 
Let us recall the features identified from the experimental design 


1 The extremist effect can lead to a significant diffusion of innovations 
which were initially, on average, badly evaluated socially and for 
personal well-being This extremist effect ıs also identified for ım- 
novations that have an average positive value 

2 A large uncertainty can lead to innovations that trigger the interest 
of many people but diffuse poorly 

3 A low social opinion can block the diffusion of innovations that bring 
an individual benefit because ıt prevents good diffusion of 
information 


'These features illustrate the richness of the model and the potential to 
relate ıt to specific innovations In particular, the influence of the social 
opinion on the diffusion of information 1s a feature that explains the 
difficulty of diffusing 1nnovations that have a low social image, even 
though they bring an objectively significant improvement to individual 
well-being Therefore, we argue that the model presents a new and mean- 
mgful typology of innovation diffusion. We consider these properties of 
the model to represent interesting progress compared to the initial thresh- 
old model Moreover, this typology may offer the possibility of grounding 
the model 1n data coming from particular concrete examples of innovation 
diffusion, even if the values of some parameters may have to be tested 
by exploration of the parameter space 

'The structure of this model could also be used to model more specific 
social diffusion phenomena, such as political votes or the purchase of 
specific products In these cases, the role of social opinion messages 1n the 
media (which were not considered 1n our design experiment) as well as 
the social network structure would be interesting subjects of research 
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The authors develop an agent-based model of dynamic parties with 
social turnout built upon developments in different fields within 
social science This model yields significant turnout, divergent plat- 
forms, and numerous results consistent with the rational calculus of 
voting model and the empirical literature on social turnout In a 
simplified version of the model, the authors show how a local 1m- 
itation structure inherently yields dynamics that encourage positive 
turnout The model also generates new hypotheses about the 1m- 
portance of social networks and citizen-party interactions 


For the past half century, social scientists—political scientists, economists, 
sociologists, and psychologists—have been intrigued by two important 
empirical regularities why people vote and why political parties behave 
the way they do The fact that millions of people vote may not seem 
puzzling (Berelson, Lazarsfeld, and McPhee 1954, Mackie and Rose 1997) 
However, given standard assumptions about rationality, voter turnout 
cannot be easily explained Numerous formal attempts to explain it predict 
vanishingly small turnout since the probability of an individual's affecting 
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Dynamic Parties and Social Turnout 


the outcome of an election approaches zero ın large populations (Palfrey 
and Rosenthal 1985, Aldrich 1993, Myerson 1998) In fact, people going 
to polls have a much higher chance of getting into a car accident This 
has led many scholars to infer that rational explanations of turnout must 
rely on an additional benefit derived from fulfilling a sense of duty or 
having a general taste for voting (Riker and Ordeshook 1968) 

The second interesting empirical regularity 1s the way political parties 
choose their electoral platforms—candidates offer voters policies that di- 
verge significantly from the median voter and remain relatively stable 
over time (Peltzman 1984, Grofman, Griffin, and Glazer 1990, Poole and 
Rosenthal 1984) Again, given standard assumptions about rationality, 
this empirical regularity 1s not obvious since early models of party plat- 
forms predicted convergence to the median voter (Downs 1957, Davis, 
Hinich, and Ordeshook 1970) or divergence across the entire policy space 
with any platform possible (McKelvey 1976, Plott 1967) ? Subsequently, 
scholars explored the impact of uncertainty on pbolicy- 
motivated parties (Wittman 1977, Calvert 1985, Roemer 2001) These 
models do yield equilibria with divergent policies, but analysis 1n a closed 
form 1s rather complex It quickly becomes intractable under all but the 
most basic assumptions (Roemer 2001) 

Because of the complexity involved in modeling both parties and voters, 
past efforts have not combined them (Osborne 1995) Models of voter 
turnout have usually relied on assumptions of fixed party platforms, while 
models of platform choice have assumed a fixed level of voter turnout 
(usually 100%) The mterdependence between people and politicians also 
has a dynamic character that 1s missmg from many models because they 
consider a single election 1n isolation Most elections are, in fact, part of 
a longer process of party competition and take place in a context of 
information about previous elections 

Economists and political scientists have also frequently abstracted away 
from elements that sociologists and psychologists believe to be critical for 
determining electoral behavior For example, many models of elections 
have avoided situating voters in social networks, or social context ın 
general Voters are often assumed to exist independently of one another 
in spite of a growing body of sociological evidence suggesting that how 
they are situated in relation to one another plays a critical role in the 
decision to vote (Lazarsfeld, Berelson, and Gaudet 1948, Berelson et al 


* The idea of platform convergence on a single issue space was borrowed from eco- 
nomics Two shops fighting for customers on a single street will choose their locations 
in the middle of the street in order to minimize the average distance to the shop for 
all potential customers (Hotelling 1929) 
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1954, Campbell, Gurin, and Miller 1954, Glaser 1959, Huckfeldt and 
Sprague 1995, Straits 1990, Knack 1992, Kenny 1992, Beck et al. 2002) 

Most models of elections also make typical cognitive assumptions about 
information and individual rationality, in spite of the evidence from psy- 
chology that both may be severely limited (eg, Simon 1982, Quattrone 
and Tversky 1988) Instead, people might use “fast and frugal heuristics” 
to deal with informational limitations and strategic complexities but still 
achieve relatively good results (cf Gigerenzer et al 1999, Cosmides and 
Tooby 1996, Lupia and McCubbins 1998) 

The complexity of including all these features ın a formal analytical 
model would overwhelm the model A closed-form solution would prob- 
ably not be tractable However, leaving some or all of these features out 
may yield incomplete inferences about voter and party behavior There- 
fore, we develop a formal model using an alternative methodology. agent- 
based models (ABMs) Like analytical models, ABMs are built on formal 
assumptions about agents (players in games) and how they interact Sim- 
ilar to the standard analytical models, the assumptions are clearly defined, 
the results are stated ın precise terms, and they are, typically, easy to 
replicate (Gilbert and Troitzsch 1999) Unlike most analytical models, 
however, ABMs are usually analyzed computationally, which means they 
are less elegant but also less susceptible to problems of tractability Com- 
putational models generate data to show the relationships between var- 
iables of interest Moreover, ABMs may make ıt easier to analyze paths 
to equilibrium, to recognize emergent patterns of interaction, and to gen- 
erate quickly models lıke this one where interaction is especially compli- 
cated (Johnson 1998) In other words, computational modeling provides 
an insight into not only the outcome of a process, but also into the dy- 
namics of the process itself without sacrificing the rigor of formal modeling 
(Nelson and Winter 2002) 

In this article, we describe and analyze an ABM of repeated elections 
in which voters and parties behave simultaneously We place voters ına 
social context and let them interact with one another when choosing 
whether or not to vote We also let parties choose the platforms they offer, 
and these choices may change from election to election depending on 
feedback from the electorate This allows us to explore the endogenous 
interaction of dynamic platforms and costly turnout In the process, we 
relax standard assumptions of unlimited information-processing capacities 
and individual hyperrationality Citizens are limited to information they 
can get from their immediate neighbors They are boundedly rational 
agents who use simple heuristics to make the turnout decision Parties 
are assumed to be more sophisticated, optimizing their choices given their 
beliefs about the expected behavior of voters and their opponents How- 
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ever, they form these beliefs based on limited information—they only 
know the results of past elections 

The computational model that we analyze generates a number of results 
that contribute to the interdisciplinary hterature on voting First, the 
average level of aggregate turnout 1s empirically realistic and varies from 
election to election within a stable range Second, we show that the model 
ıs consistent with much of the empirical evidence generated to test the 
rational calculus of voting Turnout increases as the cost of voting de- 
creases, the stakes of the election increase, and the margin of victory 
declines Thus, even though citizens have very limited information and 
use a very simple learning rule, they are able to respond as though they 
were prospectively rational to variation ın the incentive to vote Third, 
the model is consistent with empirical results from the literature on the 
social context of voting In particular, turnout correlates highly between 
neighbors, and citizens who discuss politics with more neighbors are more 
likely to vote Fourth, the model also generates a surprising result when 
citizens are situated near people with similar preferences, they are less 
likely to vote In short, segregation depresses turnout Finally, we explain 
why a local imitation structure inherently yields dynamics that encourage 
positive turnout 

The model also generates a number of results that contribute to the 
literature on party behavior First, consistent with Wittman equilibrium 
under uncertainty, policy-motivated parties offer divergent platforms In 
this setting, citizens—free to vote or abstaın—serve as a source of un- 
certainty since the location of the median voter 1s changing all the trme 
Second, parties adjust their platforms in direct response to the vote share 
in the previous election Both parties move 1n the direction of the previous 
winner and in proportion to the previous margin of victory Third, parties 
are drawn not only to the median voter, but also to the median citizen 
since she represents the median of the pool of potential future voters 
Finally, the model generates another surprising result electorates with 
higher local correlation of preferences lead to a greater divergence of party 
platforms This suggests that parties polarize as neighborhoods become 
more segregated 

In the following section, we describe the general structure and most 
important elements of the ABM of elections how voters make their de- 
cisions and how parties choose their platforms Then, we proceed with 
analysis of the main results of our model most notably, why people vote 
despite the cost of voting, and what electoral aspects influence party 
platforms In the final section, we summarize our findings and discuss the 
application of computational models of elections 1n future research 
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THE MODEL 


In this section, we describe a simplified version of our computational 
model ? As 1n the standard political science model of elections, we assume 
that each citizen ın a population has some preferred policy point on a 
one-dimensional left-right scale, which one can think of as liberal-con- 
servative issue space Two parties compete in elections, and these parties 
have fixed left and right preferences The parties choose electoral plat- 
forms (see below) and each citizen chooses to vote or to abstain If a 
citizen turns out, she chooses the party offering the platform closest to 
her own preference Votes for the left and right are counted and the 
election winner 1s determined by the majority rule After each election, a 
citizen's utility 15 simply the negative squared distance between her pre- 
ferred policy and the platform implemented by the winning party, minus 
the cost of voting * Parties are assumed to be policy motivated they have 
the same preferences and utility over the policy space as voters do (a 
party prefers to win the election with a policy closer to its ideal point) 
Parties only know their own preference point and the results of past 
elections They do not know the distribution of voter preferences and, 
therefore, they do not know the exact location of the median voter More- 
over, some of the former voters may abstain and some of the former 
abstainers may vote, meaning the location of the median voter may change 
from election to election (Brody and Page 1973) To deal with this un- 
certainty, we assume that parties use previous election results to learn 
about the voter distribution First, they use the results of the past election 
to estimate the location of the median voter? For example, if the left party 
wins in a landslide, both parties can infer that the median voter was 
located closer to the left platform than to the right platform Second, they 
use Bayesian inference to update their beliefs about the expected median 
voter ın the coming election f Given these beliefs about the electorate, the 


3 Code for the R implementation can be found at http //jhfowler ucdavis edu 


tu, = —(ty — b)” — c, where u, is the utility of voter z, x is the platform of the winning 
party, p, 1s the preference of voter ə, and c > 0 1s the cost of voting If a citizen abstains, 
c=0 

5 The location of the median voter m is the solution to the equation 


‘xp t+xpy2 
s= { ftv | m)dv 


—w 


where S 1s the vote share for the left party, x, and x, are the party platforms, and 
fo) ıs the voter distribution (which we assume to be normal with variance 1) 

6 We assume that parties model the location of the median voter as if ıt were drawn 
from a normal distribution with unknown mean and variance It 1s well known that 
under these conditions the expected median voter will be the sample mean of all 
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parties choose platforms by mutually optimizing their expected payoffs ’ 

Unlike parties, citizens employ a less sophisticated decision-making 
mechanism We model citizens as boundedly rational agents with access 
to limited information In the model, they only know the utility and 
turnout behavior of their immediate neighbors, meaning they do not know 
the true preferences of any other citizens or parties One might argue that 
this assumption ıs unnecessarily naive—surely people think for them- 
selves! However, we know from much of the empirical literature on con- 
textual effects that local information has a powerful influence on ındı- 
vidual voter behavior (Beck et al 2002, Fotos and Franklin 2002) 
Imitation has been shown to be an extremely cost-effective strategy ın 
complex environments, even if it does not necessarily lead to the best 
possible outcome (eg, Boyd and Richerson 1985) 

To model local interaction, we endow citizens with preferences and 
place them randomly on a grid ê We then allow them to have political 
discussions with other people ın their neighborhood’ Given the con- 
straints on information and the enormous complexity of maximizing utility 
over some set of future elections, citizens adopt the most successful strat- 
egies from past elections We assume that there 1s an information flow 
among immediate neighbors with respect to the past election, 1n particular, 
whether or not they voted and how satisfied they were with the results 
Since voters can learn about the turnout behavior and relative satisfaction 
of their neighbors, they can use this information to decide whether or not 
to vote ın the next election Specifically, they divide people ın their neigh- 
borhood between voters and abstainers, decide which type 1s more sat- 
isfied, and then imitate the behavior of the most satisfied group "° 


previous observations of the location of the median voter, and the variance ın the 
expected median voter will be the sample variance 


' Following Wittman (1977) we assume that the expected payoff of each party 1s the 
probability of winning times the winning payoff plus the probabihty of losing times 
the losing payoff Parties choose a set of equilibrium policies ın which neither party 
can achieve a higher expected payoff by changing its platform For a detailed descrip- 
tion of the dynamic model of policy-motivated candidates under uncertainty and Witt- 
man political equilibrium see Smirnov and Fowler (2003) 


5 We assume voter preferences are independent and drawn from a standard normal 
distribution Later in the article we will relax the independence assumption by assum- 
ing preferences are correlated between neighbors 


? We assume a Moore neighborhood structure, which means individuals typically have 
eight neighbors (top, bottom, left, right, top left, top right, bottom left, and bottom 
rght) We also assume the grid 1s bounded, so individuals on the edges have fewer 
neighbors (e g , an 1ndividual on the left edge has five neighbors—top, bottom, right, 
top right, and bottom right) 


° There are several learning algorithms that we could choose to model this behavior, 
so we have deliberately chosen a simple one Citizens have discussions with each of 
their neighbors and learn how satisfied they were (1e, their utility) with the results of 
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RESULTS GENERAL DYNAMICS 


To analyze computational results from the model, we employ three strat- 
egies First, we develop a graphical user interface (GUI) for the model so 
we can watch what happens to voter utilities, turnout, platforms, and 
other variables of interest Computational modeling 1s unique because 1t 
allows us to inspect visually what 1s happening to our model as it pro- 
gresses This sometimes leads to hypotheses about the dynamic processes 
that might not otherwise have been obvious using different methodologies 
(Gilbert and Troitzsch 1999) Second, we produce graphs of several runs 
of consecutive elections These graphs are snapshots of the dynamic be- 
havior of one or two variables from the model and they are useful for 
characterizing typical boundaries and changes ın the values for a given 
set of model assumptions Third, we conduct multiple runs and collect 
data at the end of each run This allows us to see how changes ın as- 
sumptions affect how the model behaves 

In figure 1, we present some results from a typical run of 100 elections 
The lower-left graph shows that turnout varies between 3596 and 5596 
When we let the simulation run for thousands of elections, turnout never 
jumps out of this range turnout seems to be significant and stable even 
when it 1s costly The upper-right graph shows how the model generates 
instability 1n the location of the median voter Even though the preference 
of the median citizen remains fixed for a given run (represented by the 
straight horizontal line 1n the graph), the preference of the median voter 
depends on who decides to vote and changes from election to election 
Notice especially that the median voter can remain to the left or right of 
the median citizen for several elections, indicating a period when one 
party’s supporters are more active than another’s 

The upper-left graph shows how party platforms change over time to 
adapt to these circumstances After a brief convergence from initial con- 
ditions and a period of instability, the platforms tend to oscillate in a 
stable range that remains significantly far from the center This oscillation 
seems to vary with the location of the median voter as parties attempt 
to adjust their platforms 1n the median voter’s direction Constant ad- 
Justment by the parties also generates variation m the margin of victory 
m the lower-right graph as parties alternate winning and losing elections 


the previous election Each citizen then estimates the average satisfaction with voting 
5'** by summing the satisfaction of all voters in the neighborhood (including themselves, 
if applicable) and dividing by the number of voters Similarly, they find the average 
satisfaction with abstaming s*°““" by summmg the satisfaction of all abstainers ın the 
neighborhood (including themselves, ıf applicable) and dividing by the number of 
abstainers If the number of voters or abstainers in the neighborhood is zero, then the 
individual repeats her action from the previous election 
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FıG 1 —Results of a single run of 100 elections For this run, we test a population of 1,024 voters with independent preferences drawn from a 
standard normal distribution, party preferences at —1 and 1, cost of voting of 01, and initial probability of turnout of 05 In the upper-left graph, 
solid line 1s the left party and dotted line 1s the right party In the upper-right graph, solid line 1s the median voter and dotted line 1s the median 
citizen In the lower-right graph, the dotted line marks the location of a tie (right vote share = 0 50) 
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WHY SO MUCH TURNOUT? 


The main source of turnout ın our model ıs the result of imitation ına 
social context We assume that citizens are boundedly rational, acquiring 
information only from their neighbors ın order to decide whether to vote 
in the next election In the extreme cases ın which everyone votes or 
everyone abstains, the citizen simply repeats her prior action In other 
cases, we can derive the expected probabihty that the voters 1n a randomly 
sampled neighborhood will happen to do better than the abstainers be- 
cause of the random location of their preferences " Figure 2 shows the 
probabihty that a randomly sampled citizen will vote given the number 
of her neighbors who voted ın the previous election and the cost of voting 
From top to bottom, each curve 1n figure 2 represents a higher cost of 
voting Note that increasing the cost of voting decreases the probability 
of voting for all neighborhood types This 1s because the cost of voting 
directly decreases the average satisfaction of voters 1n all neighborhoods 
This effect ıs intuitive and conforms to other theoretical and empirical 
models Note also that when voting 1s costless (the top curve), the prob- 
ability of voting 1s about 05 when about half the neighborhood votes 
and half abstains (between four and five voters 1n a nine-person neigh- 
borhood) The expected utility to voters and nonvoters 1s the same if there 
is no cost to voting, so the odds that one group does better than another 
should be the same for both at probability 0 5 However, this 1s only true 
when the number of voters and abstainers 1s about the same The down- 
ward slope ın the curves ın figure 2 indicates that citizens with fewer 
voters 1n their neighborhood are more likely to vote and citizens with 
more voters 1n their neighborhood are /ess likely to vote This suggests a 
negative reinforcement effect that encourages turnout As the probabihty 
of turnout declines, so does the expected number of voters ın a given 
neighborhood, but the probability of turnout for these neighborhoods 
increases as the number of voters ın the neighborhood decreases 
Negative reinforcement may seem counterintuitive, but consider the 
fact that each citizen 1s essentially sampling from the population When 
one sampled group 1s substantially larger, ıt ıs more likely to yield an 
average satisfaction level that 1s close to the population average The 
smaller group 1s privileged because there 1s a better chance that it will 


“ It ıs important to emphasize here that while citizen decisions are deterministic in 
our model, the distribution of preferences 1s stochastic Thus utility itself 15 a random 
variable a citizen ın an z-person neighborhood with v voters will vote in the next 
election with probability 
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Fic 2 —Theoretıcal impact of cost and neighborhood type on turnout @ = number of 
abstainers ın a neighborhood ın previous election, v = number of voters ın previous election 
(eg,v —1,a = 81s a neighborhood with one voter and eight abstaıners in the previous 
election Citizens in a neighborhood like this have a 65% chance of voting if the cost of 
voting 1s zero, 58% 1f the cost 1s 02, 49% if the cost 1s 04, and 38% 1f the cost 1s 0 6) 
Probabilities are based on assumption that neighbors have preferences that are randomly 
drawn from a standard normal distribution 


happen to have preferences very close to the winning platform For ex- 
ample, suppose that half the citizens 1n a neighborhood vote 1n the first 
election and that voting 1s costly After that, citizens decide whether or 
not to vote by comparing average utilities of voters and abstainers It 1s 
likely that eventually, the number of voters in the neighborhood will 
decrease to one or two since the cost of voting 1s positive However, if 
one of the few remaining voters happens to have a preference that 1s 
relatively close to the platform of the winning party, the voter will be 
more satisfied than the abstainers Since the number of voters 1n the 
neighborhood 1s small, her satisfaction will dominate the average satis- 
faction of turnout As a result, her neighbors will imitate her turnout 
behavior Of course, the local surge of voting will be quickly suppressed 
by the cost of voting and thus, a local turnout-abstention cycle occurs 
The global dynamic ıs a combination of all the overlapping local neigh- 
borhoods, all of which experience periods of turnout and abstention at 
different moments of time Hence, a local imitation structure inherently 
yields dynamics that encourage turnout ” 


12 The negative feedback mechanism not only leads to turnout when it 1s costly but 
also to abstention when it 1s not costly—ın fact, even 1f we make the cost of voting 
negative—turnout will still be significantly less than 100% for this reason 
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THE RATIONAL CALCULUS OF VOTING 


The rational calculus of voting model assumes that voters think pro- 
spectively about the impact of their actions on their own utility Advocates 
of this model cite several empirical regularities predicted by the model as 
evidence that these assumptions are correct In contrast, our model as- 
sumes that voters adapt to past outcomes In figure 3, we see that our 
model generates the same empirical regularities For example, turnout 1s 
sensitive to the cost of voting An increase from nothing to 0 1 depresses 
turnout by about 4% “ The tendency of voters to respond to higher costs 
with lower turnout 1s consistent with a broad empirical literature on the 
subject For example, restrictive registration laws clearly discourage vot- 
ing (Rosenstone and Wolfinger 1978, Squire, Wolfinger, and Glass 1987, 
Nagler 1991, Rhine 1995, Knack 1997, 2001, Franklin and Grier 1997, 
Fenster 1994, Highton 1997, Knack and White 2000, Highton 2000, 
Huang and Shields 2000), while liberal absentee ballot laws and all-mail 
elections encourage ıt (Oliver 1996, Karp and Banduccı 2000, Southwell 
and Burchett 2000) 

The rational calculus of voting literature also posits that voters should 
be influenced by the expected benefits from voting expressed as a function 
of the distance between the parties and the probability of influencing the 
outcome of the election Our model produces both of these relationships 
In the middle graph of figure 3, turnout increases with the distance be- 
tween party platforms, consistent with empirical work that suggests that 
turnout 1s somewhat higher in elections with higher stakes (Wolfinger and 
Rosenstone 1980, Boyd 1989, Hansen, Palfrey, and Rosenthal 1987, Jack- 
son 2000) and a larger distance between the parties (Kaempfer and Low- 
enberg 1993) 

In the lower graph, turnout varies inversely with the closeness of the 
election participation decreases as the margin of victory by one of the 
parties increases This effect 1s consistent with an empirical literature that 
has tried to use the closeness of an election as a proxy for how voters 
perceive the likelihood of affecting the outcome Though the relationship 
has been questioned by some (Key 1949, Matsusaka 1993, Kirchgassner 
and Himmern 1997, Kunce 2001), the weight of the evidence seems to 
point to a small but significant correlation between closeness and turnout 
(Cox and Munger 1989, Berch 1993, Jackson 1983, Hanks and Grofman 


3 This may seem like a trivially small cost of voting, but consider the fact that the 
mean distance between the left and right party platforms 1n our sample 1s about 0 2 
If voters must bear a cost of 0 1 1n order to vote, they are paying one-half of the total 
benefit they would receive if they could choose the election winner For most of the 
formal models cited above, the highest cost-benefit ratios that would yield positive 
turnout are typically several orders of magnitude smaller than this 
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Fic 3 —Determinants of voter behavior Each graph based on 1,000 simulations of a 
population of 1,024 voters with independent preferences drawn from a standard normal 
distribution, party preferences at —1 and 1, and initial probability of turnout of 0 5 Upper- 
right and lower graphs assume a cost of voting of 01 Cost of turnout was varied from 0 
to 1 1n the upper-left graph Means calculated using full-bandwidth LOWESS 


American Journal of Sociology 


1998, Grofman, Collet, and Griffin 1998, Nalebuff and Shachar 1999, 
Alvarez and Nagler 2000) 

The fact that the model produces results consistent with the rational 
calculus of voting suggests that the adaptation model for citizens 1s suf- 
ficiently sophisticated that they are able to learn to vote more often when 
it would make them better off—that 1s, when costs are low, stakes are 
high, and elections are close However, turnout 1s still quite high relative 
to a model ın which citizens are perfectly informed and strictly utility 
maximizing To see if this discrepancy ıs associated with limited ınfor- 
mation, we alter our model slightly by endowing citizens with memory 

Memory permits citizens to combine information from previous elec- 
tions with new information about the merits of voting and not voting 
Specifically, a memory parameter governs how new information 1s 
weighed relative to previous information ” If this parameter 1s set to zero, 
then citizens only remember the results of the past election As the pa- 
rameter increases toward one, they remember more and more of the past 
and as a consequence, the relevance of the current election decreases The 
graph ın figure 4 shows the effect of increasing citizen memory As voters 
acquire more information about the relative merits of voting and ab- 
staining, they choose to abstain 1n greater numbers 

The negative relationship between memory and turnout suggests that 
hmited information about the costs and benefits of voting plays an ım- 
portant role ın supporting high levels of participation To make sense of 
this, think of the extreme case Without memory, the only information 
citizens have 1s the relative satisfaction levels of their neighbors and them- 
selves for the most recent election With memory, citizens have access to 
all this information, plus some of the information they acquired ın pre- 
vious elections As memories lengthen, the number of individual satıs- 
faction levels that go into the average satisfaction level increases, 1m- 
proving the estimate of the relative costs and benefits of participation 


SOCIAL NETWORKS AND TURNOUT 


Our model produces results that are consistent with findings related to 
social networks At the level of the individual voter, we find correlation 


^ Let M be a memory parameter As above, citizens find the average satisfaction level 
of voting and abstaining for the current election, but they now weight the new ınfor- 
mation with previous estimates of the average satisfaction levels for voting and ab- 
staining 
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Fic 4 —Effect of memory on turnout Based on 1,000 simulations of a population of 
1,024 voters with 1ndependent preferences drawn from a standard normal distribution, party 
preferences at —1 and 1, and mitial probability of turnout of 05 Voter memory was varied 
from O (least weight on past information) to 0 9 (most weight on past information) Means 
calculated using full-bandwidth LOWESS 


ın vote strategies between neighbors For the baseline simulation, this 
correlation ıs about p = 029 and ıt does not change much when we try 
different combinations of parameters This result conforms to the finding 
that turnout 1s correlated between friends, family, and co-workers (La- 
zarsfeld et al 1948, Berelson et al 1954, Campbell et al 1954, Glaser 
1959, Huckfeldt and Sprague 1995, Straits 1990, Knack 1992, Kenny 
1992) One might argue that this 1s a trivial result After all, the model 
assumes that voters imitate their neighbors, so we should expect to find 
some correlation 1n turnout behavior However, we emphasize that this 
is the only theoretical model we are aware of that generates correlated 
turnout What 1t suggests 1s that models that do not embed their citizens 
in a social network context may be omitting an 1mportant feature of the 
real world that 1s relevant to turnout behavior 

The social network context we have supposed so far 1s artificial 1n a 
very Important way We assume that individual preferences are not cor- 
related The probability of a liberal speaking to another liberal ın our 
model 1s the same as the probability of a liberal speaking to a conservative 
However, a consistent finding 1n the social voting literature 1s that people 
tend to segregate themselves into like-minded groups As a result, most 
social ties are between people who share the same interests Even when 
people with ideological or class-based interests are not surrounded by 
like-minded individuals in their physical neighborhoods and workplaces, 
they tend to withdraw and form relationships outszde those environments 


1083 


American Journal of Sociology 


(Huckfeldt, Johnson, and Sprague 2004, Huckfeldt and Sprague 1987, 
Noelle-Neumann 1984, Gans 1967, Berger 1960) Thus, preferences be- 
tween acquaintances tend to be highly correlated For example, 1n the 
Indianapolis—St Louis Election Study (Huckfeldt and Sprague 2000), the 
correlation ın liberal-conservative ideology 1s p = 0 66, while the corre- 
lation 1n party preference 1s p = 0 54 

What effect does the concentration of shared interests have on our 
model? Figure 5 shows that preference correlation has a dramatically 
negative effect on turnout When a citizen has discussions with a diverse 
group, it 1s more difficult to discern the costs and benefits of voting 
However, when all a citizen’s neighbors are just like her, she 1s more 
likely to free ride To see why, suppose an extreme case 1n which everyone 
in a citizen’s neighborhood has the same preference When comparing 
the average satisfaction level of voters and abstainers, the benefits will 
be exactly the same for everyone The only thing that differentiates the 
voters from the abstainers 1s the cost of voting Thus, ıt would be easy 
to figure out that free nding makes sense Now suppose the opposite case 
in which neighbors have heterogeneous preferences Even though all vot- 
ers pay a cost of voting, some voters will be very satisfied because they 
happen to be located close to the winning candidate Conversely, even 
though abstaıners do not pay a cost of voting, some will be very dissatisfied 
because they have preferences that are far away from the winning can- 
didate Thus, as preference correlation decreases, the relationship between 
satisfaction level and turnout behavior breaks down, and ıt becomes more 
difficult to discern the advantage of free riding In short, social segregation 
hurts participation 


PARTY BEHAVIOR 


Turning to party behavior, we note that the model generates a substantial 
degree of platform divergence (see fig 1 above) The game-theoretic ht- 
erature suggests that uncertainty 1s a necessary condition for platform 
divergence (Wittman 1977, Calvert 1985) These models introduce an 
exogenous source of uncertainty, but 1n our model, uncertainty 1s generated 
endogenously by variation ın voter turnout The location of the median 
voter changes from election to election as new sets of voters show up to 
the polls F1gure 6 compares results when we fix voter turnout to those 
when we allow 1t to vary When we fix turnout and the location of the 
median voter 1s constant, the parties quickly infer its location and con- 
verge When we allow voters to choose whether or not to vote, the plat- 
forms diverge Clearly, parties behave differently when turnout behavior 
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Fic 5 —The effect of preference correlation on turnout Based on 1,000 simulations of 
a population of 1,024 voters with independent preferences drawn from a standard normal 
distribution, party preferences at —1 and 1, cost of voting of 0 1, and initial probability of 
turnout of 05 Preference correlation was varied from 0 to 095 Means calculated using - 
full-bandwidth LOWESS 


is allowed to vary, suggesting that 1t may be important to model both 
voters and parties simultaneously as we do here 

The model also suggests that platform divergence may result from 
parties choosing strategies that react positively to the margin of victory 
Figure 7 shows that both parties typically move their platforms ın the 
direction of the winning candidate and ın proportion to the margin of 
victory For example, if the left wins a close election, both parties will 
shift slightly to the left If the left wins ın a landslide, both parties will 
shift a lot to the left This large shift occurs because a landslide victory 
causes the winning party to infer that ıt can win with a platform that ıs 
closer to its own preferences A landslide victory also causes the losing 
party to learn that ıt must moderate ın order to be competitive ın the 
next election The relationship between platforms and vote share 1s con- 
sistent with the literature on presidential mandates (Conley 2001, Kingdon 
1966), a more detailed analysis of Wittman equilibrium (Smirnov and 
Fowler 2003), and recent evidence that shows that past vote share affects 
the ideology of US Senate candidates (Fowler 2005) 

The effect of these strategic interactions 1s that parties try to adapt to 
the (unknown) positions of the median voter and the median citizen In 
figure 1, we showed that ın a given run, the median voter changes fre- 
quently while the median citizen remains constant Parties have a short- 
term incentive to exploit the former if there tends to be some persistence 
in the set of voters who turn out from one election to the next However, 
they also have a long-term incentive to stay close to the median citizen 
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Fic 6 —Effect of mixed and variable turnout on party behavior Example based on a p 
drawn from a standard normal distribution, party preferences at —1 and 1, and cost of votin 
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Fic 7 —Effect of vote share on party behavior Based on 1,000 simulations of a population 
of 1,024 voters with independent preferences drawn from a standard normal distribution, 
party preferences at —1 and 1, cost of voting of 0 1, and ınıtıal probability of turnout of 
05 Solid line is the left party and dotted Ime ıs the right party Means calculated using 
full-bandwidth LOWESS 


who represents the pool of all possible voters 1n future elections Figure 
8 shows that platforms tend to track both changes ın the location of the 
median voter and the fixed location of the median citizen. Interestingly, 
the parties are more sensitive to the location of the median citizen than 
the median voter, which implies that parties pay more attention to the 
long-term shape of the electorate rather than to the short-term changes 

Finally, we highlight a surprising interaction between parties and vot- 
ers Figure 9 shows that increasing preference correlation among voters 
dramatically increases platform divergence This 1s because preference 
correlation tends to increase variance ın the vote share Heterogeneous 
neighborhoods will have one or two citizens switching their behavior when 
the parties adjust shghtly to the left or mght, but homogeneous neigh- 
borhoods will have several citizens switching together—small changes ın 
the location of the parties can quickly lead to waves of imitation among 
supporters of one of the parties Whole neighborhoods teeter on the brink 
of voting or not and the result 1s to increase swings 1n electoral outcomes 
This increases uncertainty about the location of the median voter and 
has a corresponding effect on the parties In short, self-segregation yields 
party polarızatıon 


SUMMARY 


The subject of elections, including turnout and platform dynamics, 1s 
challenging for all social scientists One of the main reasons for this dif- 


1087 


02 


01 


Mean Left and Right Platforms 
-01 


Mean Left and Right Platforms 
00 





-02 


-02 -01 00 01 02 -02 


Median Citizen Preference 


FiG 8 —The effect of median citizen and median voter preferences on party behavior Ba 
with independent preferences drawn from a standard normal distribution, party preferences 
of turnout of 05 Solid line 1s the left party and dotted line 1s the right party Means calcula 


Dynamic Parties and Social Turnout 


Mean Left and Right Platforms 





00 02 04 06 06 


Voter Preference Correlation 


Fic 9 —The effect of preference correlation on party behavior Based on 1,000 simulations 
of a population of 1,024 voters with correlated preferences drawn from a standard normal 
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party and dotted line ıs the right party Means calculated using full-bandwidth LOWESS 


ficulty lies 1n the fact that various elements of the electoral process are 
easier to study separately We believe that an interdisciplinary approach, 
built upon contributions from several social science disciplines, will lead 
us to a better understanding of the subject The agent-based model we 
propose 1s built upon a number of important contributions by sociologists 
(social context of voters), psychologists (bounded rationality and use of 
heuristics), economists (platform dynamics and turnout decision), anthro- 
pologists (cultural 1nfluence exemplified by 1mitation), and last, but not 
least, political scientists (ınterdependence of voters and candidates, dy- 
namic nature of the electoral competition, empirical analysis of obser- 
vations) 

Our model yields several findings consistent with the empirical hter- 
ature on parties and voters and suggests some relationships that have not 
yet been tested (see table 1 for a summary) The central result 1s that 
turnout 1s significant, platforms diverge, and they both vary over time 1n 
an empirically realistic way These phenomena emerge when we allow 
both turnout and platform strategies to adapt to one another over time 
Making citizens boundedly rational and placing them in a social context 
turns out to be important A closer looks at the model neighborhoods 
shows that local imitation in a social network inherently yields negative 
feedback dynamics that encourage turnout The effect 1s further amplified 
by the natural limits on the information-processing capacities of the cıt- 
izens, such as length of memory On the other hand, local correlation of 
preferences appears to decrease individual propensity to turn out, which 
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TABLE 1 
SUMMARY OF RESULTS 


Result 


Turnout ts significant and stable 
Turnout depends negatively on voting costs 


Party divergence 1ncreases turnout 


Turnout increases with the closeness of the election 


Longer voter memories reduce turnout 
Decision to vote depends on turnout behavior of 
socially connected peers 


Local imitation yields positive feedback for turnout 
Ideological segregation reduces turnout 
Parties diverge 


Consistent With 


Mackte and Rose (1997) 

Rosenstone and Wolfinger (1978), Squire et al 
(1987), Nagler (1991), Rhine (1995), Knack 
(1997, 2001), Franklin and Gner (1997), 
Fenster (1994), Highton (1997), Knack and 
White (2000), Highton (2000), Huang and 
Shields (2000), Oliver (1996), Karp and Ban- 
duccı (2000), Southwell and Burchett (2000) 

Wolfinger and Rosenstone (1980), Boyd (1989), 
Hansen et al (1987), Jackson (2000), Kaemp- 
fer and Lowenberg (1993) 

Cox and Munger (1989), Berch (1993), Jackson 
(1983), Hanks and Grofman (1998), Grofman 
et al (1998), Nalebuff and Shachar (1999), 
Alvarez and Nagler (2000) 

Original result 


Lazarsfeld et al (1948), Berelson et al (1954), 
Campbell et al (1954), Glaser (1959), Huck- 
feldt and Sprague (1995), Straits (1990), 
Knack (1992), Kenny (1992) 

Original result 

Original result 

Peltzman (1984), Grofman et al (1990), Poole 


and Rosenthal (1984), Hansson and Stuart 
(1984), Lindbeck and Weibull (1993), Witt- 
man (1977) 

Conley (2001), Kingdon (1966), Fowler (2002), 
Smirnov and Fowler (2003) 


Parties respond to past margins of victory 


Parties respond both to median voter and median 
citizen 
Ideological segregation yields polarized parties 


Original result 
Original result 


implies that ideologically homogenous communities are least hkely to vote 
The model also conforms to findings from the social voting lterature 
Citizens appear to be affected by the turnout decisions of their neighbors 

Turning to parties, the model yields several empirical implications Al- 
lowing turnout to vary endogenously generates uncertainty about the 
location of the median voter and causes party platforms to diverge We 
also note that parties pay attention to electoral mandates as they try to 
estimate the location of the median voter to remain competitive This 
ongoing revision of platforms could help to explain empirical work that 
shows that the ideology of US Senate candidates and the expectations 
of economic policy are sensitive to previous vote share (Fowler 2002, 2005) 
The model also shows that party platforms tend to correlate with changes 
in the position of both the median voter and the median citizen, with 
parties being more sensitive to the latter Finally, we find that a higher 
degree of local preference correlation among voters leads to greater plat- 
form divergence Voter segregation yields party polarization 
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In conclusion, though our model generates relationships that correspond 
to much of what we know about turnout and platforms, 1t 1s 1mportant 
not to read too much into the results There are many factors that we 
have not included here that may affect turnout and platforms, such as 
socioeconomic status, endogenous voter and party preferences, multidi- 
mensional issue space, multiple parties, multiple districts, different elec- 
toral institutions, political institutions like legislatures, and so on Agent- 
based modeling makes it easy to add such factors quickly to see 1f and 
how they are relevant, but we believe that initial modeling efforts for 
problems İlke these should remain simple ın order to provide a bridge to 
what may already be an extensive analytical effort Our hope 1s that this 
approach will not only provide good predictive models of electoral politics, 
but will also generate hypotheses that inspire future analytical efforts to 
find related closed-form solutions and empirical efforts to test relationships 
suggested by the model 
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Sociology and Simulation: Statistical and 
Qualitative Cross-Validation! 
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Agent-based simulation modeling enables the construction of formal 
models that simultaneously can be microvalidated against accounts 
of individual behavior and macrovalidated against aggregate data 
that show the characteristics of many socially derived time series 
These characteristics (leptokurtosis and clustered volatility) have 
two important consequences first, they also appear 1n suitably struc- 
tured agent-based models where, like real social actors, agents are 
socially embedded and metastable, second, their presence precludes 
the use of many standard statistical techniques hke the chi-square 
test These characteristics 1n time-series data indicate that a suitable 
agent-based model rather than a standard statistical model will be 
appropriate This ıs illustrated with an agent-based model of mutual 
social influence on domestic water demand The consequences for 
many frequently used statistical techniques are discussed 


INTRODUCTION 


The core of this article 1s an agent-based simulation model that relates 
some aspects of social influence between households to their aggregate 
water demand patterns The purpose of the model is to illustrate that 
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such models can be microvahdated against accounts of the actors’ be- 
havior, and that they can also produce aggregate outcomes consistent with 
what ıs observed This model ıs not intended as a definitive demonstration 
of how such households actually behave and interact, for the model re- 
quires considerably more micro- and macrovalidation before reliable con- 
clusions could be drawn However it does establish the possibility of doing 
so and is the first step ın that direction Further ıt shows how accounts 
of behavior that were previously difficult to include ın formal sociological 
models can be incorporated in an agent-based model, allowing their con- 
sistency with the aggregate data to be assessed Thus agent-based mod- 
eling also increases the types of observation that can be modeled and 
hence made more open to sociological scrutiny 

The article proceeds as follows First, we position and contrast the role 
of simulation models with respect to both social theory and econometric 
modeling to emphasize how and ın what ways this approach differs We 
then look at the signs of volatility ın social data (ıncludıng the data used 
here) and consider its possible causes 1n terms of the relationships among 
social actors The consequences for existing statistical models are then 
ıllustrated by means of an examination of some examples that utilize 
statistical assumptions This provides a key motivation for the agent-based 
example that follows The differences between the two are brought into 
focus 1n terms of the contrast between descriptive and generative models, 
ın particular, the possible masking of the signs of volatility ın data as a 
result of its aggregation over time ıs demonstrated We conclude with a 
summary of the consequences for the practice of formal modeling in 
soclology 


SOCIAL THEORY AND SIMULATION 


The relationship between social processes and institutions on the one hand 
and social statistics on the other has been an ımportant and controversial 
issue 1n sociology at least since the publication at the turn of the 20th 
century of Max Weber’s The Protestant Ethic and the Spirit of Capitalism 
(Weber 1958) In this article, we are concerned with that relationship We 
argue that, on the basis of the evidence of social inquiry, analytic models 
do not obviously explain important properties of social statistics However, 
a Class of simulation models does generate numerical outputs that are 
consistent with important properties of real social statistics These models 
have two additional properties that should be of consuming interest to 
sociologists One ıs that they appear to produce data with empirically 
relevant properties because they capture features of social order that are 
the subject of sociological inquiry—the social embeddedness of individuals 
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together with the emergence of social norms The other 1s that these models 
naturally draw upon and cohere with the sort of detailed, qualitative 
studies of social processes found ın core strands of the sociological 
literature 

Our central argument can be seen as an operationalization of some 
elements of structuration theory (Giddens 1984) According to Blaikie 
(1993), Giddens proposed that social research can take place at four related 
levels (1) hermeneutic elucidation of frames of meaning, (2) investigation 
of context and form of practical consciousness, (3) identification of bounds 
of knowledgeability, and (4) specification of institutional orders Of these 
four levels, says Blaikie, the first two are “micro” and best investigated 
qualitatively, while the second two are “macro” and best investigated with 
quantitative methods This view 1s very close to ours The microbehavior 
is the behavior of observed actors and 1s described by autonomous soft- 
ware modules called agents The macrobehavior 1s the behavior of a social 
institution (organization, community, set of customers, etc ) or a collection 
of such institutions and 1s described by the properties of the model con- 
taining the agents The properties of the model as a whole are amenable 
to summary using descriptive statistics, while the behavior of the ındı- 
vidual agents can (and we argue should) be described qualitatively 

We neither seek nor claim an exact parallel with structuration theory 
Agents are not replications of persons They are simplified, formal rep- 
resentations Their simplicity and formality reduces the ambiguity of any 
analysis of their behavior and social interaction at the cost of losing ex- 
pressiveness relative to qualitative studies of observed actors Conse- 
quently, there seems little to be gained by an in-depth study of a her- 
meneutic circle of a social sımulatıon model As will become evident below, 
there 1s more to be gained from an analysis of the social context of ın- 
dividual agents—the other agents with which they interact and the pat- 
terns and extent of the influence of specific agents upon one another At 
the macrolevel, our concern 1s with the identification of “statistical sig- 
natures" that can be explained by appeals to qualitative microlevel be- 
havior and interaction When such mıcrolevel phenomena can be dem- 
onstrated to describe aspects of observed social behavior and interaction, 
and, at the same time, to generate the macrolevel phenomena sharing the 
statistical signature of real social data, then we shall say that the model 
has been cross-validated That 1s, the microlevel behavior has been val- 
idated qualitatively by domain experts, and the macrolevel data have 
been validated by comparing statistical properties of the numerical out- 
puts from the models with real social statistics The hnk between the two 
1s provided by simulations with the model The chains of causation 1den- 
tified 1n simulation runs demonstrate a possible explanation of the link 

Our reliance on simulation models rather than on closed analytic or 
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statistical models together with our focus on empirical validation ın re- 
lation to both qualitative and statistical data imply a methodology that 
relies on a close interaction between observation and conceptual devel- 
opment to the exclusion of prior theoretical specification Our argument 
and methodology are atheoretical We offer no view on the prospects for 
a general social theory, but we do insist on the importance of observation- 
based conceptual development as a precursor to any future social theory 

We begin below with an account of a widespread social statistical sig- 
nature that 1s shown 1n the following section to be consistent with the 
sociological phenomenon of social embeddedness In the final two sections 
we report the specification of a model of domestic water demand and 
then develop it to support an example of cross-validation This model 
incorporates agent designs based on descriptions of individual behavior 
provided by domain experts from the U K water supply industry and 
regulatory agencies 


THE CONTRAST WITH ECONOMETRIC MODELING 


Statistics are used by social scientists not only to inform descriptions of 
existing social phenomena but also to forecast future outcomes In practice, 
there are many events that cannot be forecast Indeed, there has never, 
in the history of statistical analysis, been a correct statistically based 
forecast of a turning point in either macroeconomic trade cycles or 
financial market prices and volumes ^ Moreover, 1t has been known for 
more than a quarter century that statistical relations obtained from re- 
gression analyses on data covering one time period £ypıcaliy support dif- 
ferent explanations of social relationships than are 1ndicated by data cov- 
ering a later (postpublication) time period (Mayer 1975) 

In this section, we concentrate on a general feature of social statistics 
of which the unpredictability of turning points and changing statistical 
relations are a special case We then demonstrate that the problem 1s far 
more general than 1s normally recognized Since most social science models 
with this purpose are economic in nature and predictive in (avowed) 
purpose, we will start by examining their success Since they are predictive 
rather than explanatory ın purpose, it 1s on predictive success they are 
judged Our purpose 1s more fundamental, namely to examine the use- 
fulness of these kinds of models 1n capturing social phenomena and the 


? This statement has not been shown to be wrong ın a wide search of the econometric 
and economic literature Neither has ıt been disproved upon circulation with a request 
for disconfirmation on the e-mail discussion hist of the International Institute of Fore- 
casters, nor upon publication ın a leading journal (Moss 2002), or ın discussions with 
a wide range of econometricians and social statisticians 


1098 


Sociology and Simulation 


statistics derived from them We suggest that suitable agent-based models 
will not be better at prediction but, unlike economic models, can be used 
to help explain and understand those phenomena 

Both forecasting failure and unpredictable changes 1n estimated sta- 
tistical relationships can be consequences of clusters of volatility in the 
data where neither the timing nor the magnitude nor the duration of the 
volatility can be predicted Some obvious cases are turning points in 
macroeconomic tıme series associated with upturns and downturns in 
economic performance (investment, employment, consumption, etc ) and 
turning points 1n stock market prices These turning points are marked 
by much bigger changes in the values of statistical variables than are 
observed between turning points—indeed, such marked turning points 
are instances of clustered volatility 

The usual explanations of unpredictable macroeconomic turning points 
are elther the occurance of some kind of structural change (Clements and 
Hendry 1995, 1996) or the argument that volatility begets volatility (for 
a while, Bollerslev 1986, Engle 1982, Hansen 1982) so that the parameters 
of the statistical distribution from which observations are said to be drawn 
themselves vary over time The latter set of techniques constitute the 
approach of time-varying parameters (TVP) None of the estimating ap- 
proaches associated with either of these explanations has yielded a correct 
forecast of a turning point or any other episode of volatility The TVP 
approach has also been applied to financial data with no more success 
than was obtained with macroeconomic data (Bollerslev 2001) 

The only test of the soundness of these techniques 1s their ability to 
capture salient aspects of the data series to which they are applied Like 
all other forecasting techniques, they fail to forecast future clusters of 
volatility The TVP approach does capture some aspects of previously 
observed data series but only because those aspects are already implicit 
in the estimating techniques (Tay and Wallis 2000) 


THE DATA AND THEIR CHARACTERISTICS 


The wide recognition of the recalcitrance of macroeconomic and financial 
market turning points to forecasting has not been extended to the time- 
series data from other social and economic processes Yet, the more general 
observation of unpredictable clusters of volatility appears to be far more 
widespread As Moss (2002) has pointed out 1n a slightly different context, 
unpredictably clustered volatility characterizes the sales values and vol- 
umes of many fast-moving consumer goods sold in UK and US super- 
markets and, we now demonstrate, to domestic water consumption 
Figure 1 illustrates the aggregate water demand from a small area 1n 
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Fic 1 —Daily water consumption (liters per day) ın a metered UK neighborhood 


the United Kingdom One can see that there are occasional spikes, both 
upward and downward, in domestic water consumption that might be 
associated with weather conditions but that do not occur at the same 
times of the year While the quality of the data 1n general 1s as good as 
we can expect, there 1s a long period from October 1995 into November 
1997 when the readings were constant This 1s clearly a problem with the 
data collection, and the data for that period have been excluded from all 
subsequent analysis Figure 2 shows the graph of proportional changes 
in the daily consumption of water 

The histogram 1n figure 3 gives the frequency distribution of the changes 
in the daily values of water consumption, and the continuous ogive 1s the 
normal distribution for the same mean, standard deviation, and sample 
size A standard test for normality, the Kolmogorov-Smirnov statistic, 
gives, to three significant figures, zero confidence that the observed dis- 
tribution ın figure 3 1s normal The higher, thinner peak of the actual 
frequency distribution with respect to the corresponding normal dıstrı- 
bution ıs called leptokurtosis (1e , thin peaked) and ıs due to the presence 
of significant numbers of values relatively far from the mean These dis- 
tant values are manifestations of volatile episodes The district for which 
this data were collected 1s small and 1s thus effectively homogeneous with 
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FIG 2 —Relatıve changes ın daily water consumption 


respect to weather The volatility 1s not substantially caused by changes 
ın the seasons, as ıt remains 1f these are factored out 

Similar results are found ın sales value and volume data for a wide 
range of fast-moving consumer goods An example, reproduced from Moss 
(2002), 1s U K supermarket sales data for three brands of shampoo Sim- 
ilar results are found for virtually every one of the 120 or so brands of 
shampoo for which we have the data, as well as every brand of tea, 
shaving preparations, biscuits, and, ın the United States as well as the 
United Kingdom, every one of some 200 brands of spirituous alcoholic 
beverage and beer The first row of figure 4 shows weekly sales values 
Brand A 1s a leading brand with no discernable sales trend, while sales 
values of brand B are declining, and sales values of brand C are increasing 
Both of the latter have small market shares The second row shows the 
time series of relative sales changes Over the 65 weeks there were obvious 
clusters of volatility, and it 1s these clusters that generated the extreme 
values that cause the leptokurtosis evident in the third row, showing the 
frequency histograms of the relative sales changes compared with the 
corresponding normal distribution 

Each data set manifesting leptokurtosis and clustered, large changes 1n 
variable values can be ascribed a separate and special reason In markets 
for fast-moving consumer goods, the phenomenon might be due to special 
offers (though we have found no evidence that this 1s so) In water con- 
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Fic 3 —Frequency histogram and normal ogive for relative changes ın daily water con- 
sumption data 


sumption data, the clustered changes could be due to the weather (though 
seasonally adjusting the data does not affect the outcome) In the financial 
markets, there are speculative bursts (though ıt seems to us implausible 
that widespread shifts ın expectations might be the result of strictly ın- 
dividual sentiment) In national and international economic systems (or 
macroeconomies), episodes of large clustered changes are often ascribed 
to some kind of structural break or exogenous force (though 1f these were 
uniquely identifiable, the occasions on which their consequences 
amounted to a turning point 1n the trade cycle ought also to be uniquely 
identifiable and hence correctly forecast) Rather than looking only for 
special reasons to account for unpredictable and clustered volatility in 
each type of social institution, we find ıt natural to investigate whether 
there 1s any element of generality ın the generation of these unpredictable 
phenomena 
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Fic 4 —Weekly shampoo sales and relative sales change, January 2, 2000-March 25, 2001 The top row shows sales value against time, the middle 


row the relative sales change over time, and the bottom row the distribution of relative sales changes compared to a normal curve with the same 
mean and SD (Source Information Resources International, reproduced from Moss [2002]) 
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VOLATILITY AND ITS POSSIBLE CAUSES 


Following Granovetter (1985), we define social embeddedness as a state 
in which an actor 1s significantly influenced by individual relationships 
with other actors That ıs, the influences on the actor cannot be modeled 
as if all actors were identical (by using, e g , representative or homogeneous 
agents) The particularities of ıts interactions with those to which it relates 
have a significant 1mpact upon the outcomes 1n the model While social 
embeddedness neither 1mplies nor denies optimizing behavior of the sort 
assumed by economists, ıt does mean that the simphfying assumptions 
used by economists in their formal-analytic rational actor models critically 
distort the phenomena being analyzed 

There appear to be no models incorporating agents as defined 1n con- 
ventional economic theory based on utılıty-maxımızıng rational actors that 
generate the unpredictable volatile clusters described ın the preceding 
section Indeed, results from a wide range of agent-based social simulation 
models suggest that four conditions are associated with clustered volatility 
at the macrolevel The four conditions are as follows * 


1 Individuals are metastable 1n the sense that they do not change their 
behavior until some level of stimulus has been reached They would 
not, for example, reconfigure their desired shopping basket as a result 
of a penny's rise ın the price of a tin of tuna or change their religion 
because they dislike a sermon A particular implication of metasta- 
bility 15 that the behavior of individuals cannot be represented by 
utility- maximizing agents 

2 Interaction among agents 1s a dominant feature of the model dy- 
namics This amounts to social embeddedness in the sense of Gra- 
novetter (1985) the behavior of individuals cannot be explained ex- 
cept ın terms of their individual interactions with other individuals 
known to them 

3 Agents influence but do not slavishly 1mitate one another 

4 'The system 1s slowly driven so that most agents are below their 
threshold (or critical) states much of the time In effect, most of us 
engage 1n routine behavior most of the time without fundamentally 
changing ether our behavior or the expectations that drive that 
behavior The system 1s slowly driven if we are not frequently over- 
whelmed by pressures to change expectations and behavior 


An agent-based social simulation model contains agents that are 1n- 
dependent computer programs with an ability to perceive aspects of their 
environments (including some other agents) and to process those percep- 


* These are social interpretations of a more general set of conditions identified by Jensen 
(1998) in respect to physical models 
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tions ın order to produce some effect on their environment (including 
other agents) and perhaps to change the ways in which they process 
perceptions into actions As will be explained 1n more detail below, an 
important feature of agents 1s that they and their interactions can be 
designed by the modeler to describe the behavior and interactions of social 
entities, whether these are individuals or organizational units Conse- 
quently, agent-based social simulation models can be validated by com- 
paring the agents and their social behavior with individual and social 
behavior found in real societies, and by comparing the properties of nu- 
merical outputs of these models with the properties of real social statistics 
Edmonds (1999a) provides a clear example of an agent-based model where 
the agents are demonstrably socially embedded 

Unfortunately, the meanings of such phrases as "statistical signature" 
and "the properties of real social statistics" are by no means clear and 
unproblematic In some cases, the variance or even the mean of a dıs- 
tribution 1s not defined There 1s not even any reason to believe that 
observed social statistics are drawn from some fixed, underlymg popu- 
lation distribution, nor any reason to believe that (1n some cases) they are 
not (these 1ssues will be investigated below) In the meantime, we note 
that both societies and models can be viewed as data-generating processes 
Many social arrangements, including national economies, retail shops and 
supermarkets, and collections of households, generate time-series data 
marked by clusters of volatility the timing, magnitude, and duration of 
which cannot ın practice be forecast Models with the above four char- 
acteristics also generate time-series data with the same clustered volatility 
properties 

In the following two sections, we describe a model and the background 
to 1ts development in order to exhibit the extent to and the ways 1n which 
that model 1s validated by domain experts We also report simulated time- 
series data generated by the model under various assumptions 1n order 
to suggest that the sımulated data, like the real data, exhibit unpredictable 
clusters of volatility We also note the incompleteness of the validation as 
well as problems with the identification of the statistical signatures of 
leptokurtosis due to clustered volatility 


THE IMPORT FOR STATISTICS AND SOCIOLOGY 


In this section we identify clearly what agent-based modeling adds to the 
range of statistics-using articles often published ın this journal We ıden- 
tified 11 such articles published ın the American Journal of Sociology 
from March 2003 to January 2004 We then chose two of those articles, 
Huffman and Cohen (2004) and Harding (2003), which seemed to us to 
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represent good examples of careful reasoning with clear discussions of the 
limitations of their statistical approaches The purpose of this section 1s 
to describe how agent-based modeling might reasonably have influenced 
even these careful discussions We also discuss a third, somewhat older 
article (Healy 2000), which would have benefited from a complementary 
analysis using agent-based modeling for different reasons 

Harding presents an account of the effects of selection bias on statistical 
estimates of the effect of neighborhood poverty on school dropout rates 
and the incidence of teenage pregnancy Has critique of the effects of 
selection bias 1n previous studies of neighborhood poverty 1s matched by 
his exposition of the limitations of the technique he 1s proposing the 
counterfactual causal framework employing propensity score matching 
and sensitivity analysis This 1s a nonparametric approach, and so prob- 
lems associated with the population distribution (if there 1s one) inferred 
from a leptokurtic sample distribution do not arise However, 1n one of 
the steps of his estimation procedure, Harding uses a chi-square statistic 
to correct for unwanted clustering 1n his data (p 689) This 1s by no means 
crucial to his argument, but 1t does represent a standard use of statistical 
analysis 1n all applied social sciences 1ncluding sociology 

The chi-square distribution can only be used to distinguish two samples 
drawn from normal population distributions where the sample standard 
deviations are the same, and the test 1s for differences 1n means, or the 
sample means are the same, and the test 1s for differences in the standard 
deviations The discussion 1n the previous section indicates that strong 
social interaction and influence together with metastable individual be- 
havior lead to macrolevel statistical distributions that cannot ın general 
be drawn from a normal population distribution In these circumstances, 
no test or regression analysis based on a normal distribution satisfies its 
conditions of application 

Now Harding refers to neighborhood effects 1n terms of poverty, un- 
employment, racial composition, and other numerical and neighborhood- 
level indicators The role of social interaction 1s not considered There 1s 
nonetheless the difficulty that the statistical analysis might be 1nappro- 
priate because of the effects of social interaction One means of assessing 
the extent of those effects would be to produce an agent-based model of 
the phenomena of interest and to explore the conditions 1n which 1t was 
possible to simulate the social process leading to dropping out of school 
or unsafe sexual encounters 1n such a way that the conditions are met for 
the application of a chi-square or any other statistical test If, for example, 
the relative changes 1n the simulated school dropout rate turned out to 
be distributed leptokurtically, then no chi-square test would be applicable 
to the simulated data Such simulation results constitute a warning flag 
to be investigated by looking for evidence of leptokurtosis and clustered 
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changes 1n dropouts or pregnancies 1n the actual data We know from the 
central limit theorem that aggregating virtually any data series sufficiently 
over time will hide leptokurtosis and cause the distribution of such coarse- 
grain (1e, aggregated over time) data to approximate the normal Con- 
sequently, agent-based simulations that generate clustered volatility and 
leptokurtosis at fine grain should indicate the need to investigate whether 
there 1s clustered volatility and leptokurtosis ın the actual data at the 
same grain 

The second article, by Huffman and Cohen (2004), tests whether job 
segregation and job devaluation increase with the size of the local black 
population and increase wage inequality As in the Harding article, there 
1s no explicit analysis of the effects of social embeddedness or such char- 
acteristics of individual behavior as metastabihty Nor ıs there any test 
for leptokurtosis If leptokurtosis ın the sample data were found, then the 
regression analyses performed by Huffman and Cohen would be otiose, 
since the conditions of application of such analysis would not be met 
(Fama 1963) In that case, the only models that might capture the phe- 
nomenon being analyzed would be agent based with the agents being 
designed to describe the behavior of the individuals whose actions col- 
lectively generate the observed statistics Note that we are not claiming 
that the Huffman-Cohen analysis 1s wrong We are claiming that, 1n light 
of results from the agent-based social simulation literature, the analysis 
could now be made more complete 

The third article considered here (Healy 2000) differs from the other 
two 1n that it explicitly takes into account microlevel behavior and social 
interaction Healy specifies 10 hypotheses based on qualitative analyses 
ın the sociology literature regarding mainly cultural and institutional 1s- 
sues These hypotheses were tested by means of logistic regressions on 
extensive survey data reported in a white paper from the Council of 
Europe 

There 1s another, and for our purposes much more important, hypothesis 
drawn by Healy from the literature 


Pıhavın and Callero’s (1991) work suggests that people are more likely to 
donate blood if they know other donors, or ıf they know people who have 
received transfusions (or other blood products) Similarly, Drake, Finkel- 
stein, and Sapolsky (1982, pp 81-83) report that those who are “close to 
blood needs” will be more likely to donate We should expect typical network 
effects here if all your friends are blood donors, you are likely to be one 
too If you know a hemophiliac, you should also be more likely to have 
given blood at some point 

HYPOTHESIS 4 —The odds of donating blood increase tf you know 
anyone (including oneself) who has vecevved a blood transfusion (Healy 
2000, p 1636) 
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The literature motivating this last hypothesis suggests that blood donors 
are socially embedded ın the sense that they are influenced by their social 
context and the behavior and circumstances of their acquaintances If 
individuals have thresholds above which their behavior changes—for ex- 
ample, some number of acquaintances who have needed blood transfu- 
sions where that number could be conditioned by the closeness of the 
acquaintanceship or similarity of circumstances—and they are not simply 
swamped by stimuli, then we would look for clusters of volatility ın time- 
series data and power-law distributed cross-sectional data If there 1s any 
defined population distribution from which the data can be treated as a 
sample, then its variance, and, as likely as not, its mean, are not defined 
This means that the law of large numbers does not hold, and that time 
series generated by such a social process will be heteroscedastic No para- 
metric regression analysis 1s applicable ın such conditions 

The only test of which we are aware that would inform a determination 
of whether qualitative, institutional data can 1n principle be used to for- 
mulate regression models turns on agent-based social simulation. As we 
have pointed out, social simulation models ın which agents influence but 
do not generally ımıtate one another, and ın which agents respond to some 
significant stimuli but do not optimize over any continuous function fre- 
quently produce results that are characterized by clusters of volatility in 
time-series data and power-law distributed cross-sectional data Moreover, 
even with full knowledge of the model—hence the data-generating mech- 
anısm—nelther the timing nor the frequency nor the magnitude of the 
volatile clusters can be forecast Neither can the values ın the immediate 
aftermath of the volatihty be forecast 

The description of these articles will suffice to make the point clear 
Agent-based social simulation shows that there 1s some consistency be- 
tween individual behavior and social interaction on the one hand, and 
social statistics on the other hand Econometricians have recognized this 
implicitly, and social simulation models frequently show this relationship 
explicitly If sample data are sufficiently fine grained to show leptokurtosis 
and clustered volatility, then that statistical signature indicates an inves- 
tigation into whether metastable individual behavior and strong social 
embeddedness are important features of the relevant social processes 
Further confirmation or disconfirmation of that proposition 1s naturally 
sought by implementing agent-based models validated against observed 
or perceived behavior The simulation designs and outputs can lead to 
the same sorts of assessments of social phenomena that were sought in 
the three articles considered ın this section Conversely, if social simulation 
models indicate leptokurtosis and clustered volatility that 1s not found ın 
correspondingly fine-grain data, there 1s no empirical support for the 
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social simulation model, and there ıs no contraindication to the use of 
aggregate (in the Harding article, nexghborhood-level) data 


AN EXAMPLE A MODEL OF SOCIAL INFLUENCE ON DOMESTIC 
WATER CONSUMPTION 


In this section we illustrate how an agent-based simulation can sımul- 
taneously be consistent with accounts of the behavior of households and 
produce aggregate time series similar to those observed Thus the model 
reported ın this section was not designed and implemented to provide a 
complete sociological analysis of domestic water consumption It was pro- 
duced as part of the CC DEW (Climate Change Demand for Water) 
project commissioned by the UK Environment Agency and involving 
all UK government agencies involved ın the regulation of the water 
supply industry as well as the water supply companies The purpose of 
the project was to provide projections of the effects of climate change on 
industrial, agricultural, and domestic water demand Previous projections 
had been based on econometric models The innovative feature of CC 
DEW was the use of agent-based models to support the development of 
scenarios of water demand contingent on different paths of political de- 
velopment and different patterns of climate change Both in the CC DEW 
project and a recent European project called FIRMA,’ agent-based social 
simulation models were integrated with physical models so that the whole 
model system formally captured both physical and social processes 

The model ıs relevant to the present argument because ıt was designed 
to facilitate the capture of qualitative judgments, perceptions, and ınfor- 
mation by stakeholders In particular, we developed the model reported 
here to capture the common perceptions and judgments of the represen- 
tatives—the water industry and its regulators—regarding the determi- 
nants of domestic water consumption both during and between droughts 
in the United Kingdom That a complete analysis of domestic water de- 
mand in the United Kingdom would also entail the elicitation of the 
householders’ accounts of their own determinants of their water con- 
sumption will be obvious to any social scientist That is not the purpose 
of this discussion Our concern 1n this section 1s only to provide an example 
of how an agent-based model was used to condition statistical analysis 
of the outcomes from a social process The key point 1s that the model 
generated leptokurtic time series with clustered volatility This was com- 
pared with the relevant real, social statistical data on domestic water 
consumption The comparison and the grain of the available data enabled 


* See cfpm org/firma 
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us to argue with greater confidence than would have been possible with 
only more coarse-grain data that parametric statistical analysis 1s not 
appropriate to the social processes described by the stakeholders 

The first version of the model was constructed with little mput of 
domain expertise ın order to capture the effect of enjoinders on households 
to conserve water during periods of drought This version demonstrated 
that social 1nfluence was sufficient significantly to reduce domestic water 
consumption when a minority of households was easily influenced by 
official requests, provided that some households were so influenced and 
other households were influenced by other households with whom they 
had some stable relationship and were, in a well-defined sense, similar to 
themselves Domain experts from the water supply companies evaluated 
the first version which they found deficient ın that, when a drought ended, 
aggregate water consumption immediately returned to its predrought lev- 
els The second version was designed to address this deficiency Whereas, 
in the first version, households would observe and be influenced by the 
actions of their neighbors 1n the prevailing conditions, 1n the second ver- 
sion 1t was only the neighbors’ actions that influenced households, and, 
moreover, the influence was assumed to decay over time 1n accordance 
with evidence from experimental cognitive science (Anderson 1993) 5 

The CC DEW model was mnplemented ın SDML (Moss et al 1996), a 
strictly declarative language ın which every agent 1s implemented as a 
set of rules and databases The databases contain logical clauses and the 
rules place clauses on the databases once those clauses are proved to be 
true 1n the existing conditions 

The core element of the representation of household behavior ıs the 
endorsements mechanism (Cohen 1985) Each agent has a set of rules that 
determine endorsements to be attached to their own representations of 
other agents For example, another agent might be deemed to be reliable, 
trustworthy, kind, generous, or to have other such attributes In the model, 
there would be rules specifying the conditions ın which tokens such as 
reliable, trustworthy, kind, and generous would be applied to these other 
agents 

The model incorporated both a hydrological model to determine water 
content of the soil on the basis of historical temperature and precipitation 
data, and a social model to capture the effects of that water content on 
policy agencies and their pronouncements, as well as the effects of those 
pronouncements on household consumption These component models are 
described ın turn 


* For a more detailed account of this development, see Downing, Moss, and Pahl Wostl 
(2000) 
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The Hydrological Model 


The purpose of the hydrological model 1s to simulate the occurrence of 
drought conditions on the basis of real precipitation and temperature data 
The policy agent determines there to be a drought when soil holds less 
than 85% of its water retention capacity for two or more consecutive 
months 

The soil water content for any month 1s determined by adding to the 
previous month's content the total precipitation during the month in 
question less the evapotranspiration If the result 1s less than the water 
retention capacity of the soil, then that 1s the current soil water content 
If the result 1s greater than the capacity, the difference 1s runoff 

Potential evapotranspiration (PET) ın the model 1s determined by tem- 
perature, mean day length during the month (calculated for the relevant 
latitude for each day relative to the winter solstice), and a monthly cor- 
rection factor (Jan —March, 0 8, April, 0 9, May, 1, June-Aug , 1 1, Sept, 
105, Oct, 085, Nov-Dec, 08) The uncorrected value of PET 1s cal- 
culated from the modified Thornthwaite algorithm from temperature and 
hours of daylight per day The value of the unadjusted PET 1s calculated 
as 


—4158547 + 32 24417 —04325T?^ 265 <T 
16 5(97/HY 0<T<265 
0 T «0 


where T represents the temperature range, and H 1s heat, defined as 


T 1514 
nn (E) 
07 
and the exponent a 1s 
a = 6 75e H? — 7 "1e ^H? +001792H + 049239 


The resulting unadjusted value of PET 1s adjusted to allow for day length 
so that 


GPET = uPET x (days/30) x (mdl/12), 


where PET ıs the unadjusted value of PET determined as above, days 
is the number of days in the month, and mdl ıs the mean day length ın 
the month ^ The physical model used was conditioned to the availability 
of only monthly data, which required us to 1mplement the model with 
the month as the time step for data generation 


* This model was provided by Tom Downing of the Stockholm Environment Institute, 
Oxford 
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The Social Model 


Because nobody knows everybody but everybody knows somebody, the 
social model was constructed to capture the fact that every individual 
has a limited but usually nonempty set of acquaintances This was 
achieved by situating agents on a toroidal grid and enabling each agent 
to know every agent within a set number of cells ın each of the four 
cardinal directions In all of the simulation runs reported here, the grid 
size was 16” (= 256) cells, there were either 80 or 100 agents distributed 
at random on the grid with no more than one agent to a cell, and each 
agent had a horizon of four cells ın the sense that ıt knew all agents ın 
the four cells to the right, left, up, and down 

Two abstract types of water consumption events were defined private 
consumption events that are not visible to any but the consuming agent, 
and public consumption events that are publicly visible Each agent 1s 
able to observe the pubhc consumption events of all agents known to it 
Each agent decides how much water to use during each event, and the 
frequency of use events are measured by the number of events per month 
This 1s also the information viewed by the agent’s neighbors 

In addition, each agent can read the exhortatory policy pronouncements 
of the policy agent These pronouncements take the form of recommended 
frequencies of each activity (public and private) as well as volume con- 
sumed per use event As indicated, such pronouncements are made only 
during periods of drought defined as months m which soil water 1s below 
85% of capacity and has been below that level for at least two consecutive 
months The frequency and volume per event recommended by the policy 
agent are both determined by the previous month's use and the current 
shortfall of soil water below 85% of capacity So 1f, for all domestic house- 
holds, the previous average frequency of a water-using activity were f and 
the volume used per event were u and the current soil water magnitude 
1s 75 (capacity — 100), then the policy agent would recommend that each 
household engage in the consumption activity on 0 75 x f occasions 
(rounded down to the nearest integer) and that the volume used per event 
be 0 75 x u In every drought month when the soil water diminishes, the 
frequency and volume demanded by the policy authority 1s reduced ac- 
cording to the above formulae The policy agent does not change the 
demanded frequency or use per event when drought conditions persist 
but the soil water increases When the drought ends—because soil water 
exceeds 85% of capacity—the policy agent simply ceases to offer any 
exhortation 

This behavior by the policy agent captures at coarse grain the behavior 
actually observed with regard to the authorities! public pronouncements 
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and statutory restrictions in the United Kingdom during periods of 
drought 


Agent Cognition 


The cognitive processes of the agents were specified to capture observed 
patterns of demand for goods and services by representing social ınter- 
actions that have been well documented by observation and experrment 
by social psychologists This was done in a manner that also reflects the 
fact that individuals change their behavior ın response to a significant 
weight of evidence and social pressure, but not ın response to small 
changes ın price, incomes, or any other variable 

The representation of the decision-making process 1s the endorsements 
mechanism described above Two types of objects were endorsed by 
household agents ın the CC DEW model actions and other household 
agents The actions available to the household agents are water-use actions 
consisting of use frequencies and consumption per use event The other 
household agents are, of course, those that they can see 

There 1s an endorsement scheme for each type of object to be endorsed 
So each household has one endorsement scheme for water-use actions and 
one for visible household agents For all households, the tokens with which 
actions and agents, respectively, can be endorsed are identical 

The endorsements on actions are globally sourced, neighborhood 
sourced, self-sourced, or best endorsed neighbor sourced The first three 
are straightforward Globally sourced actions are actions that the policy 
agent 1s recommending publicly Neighborhood sourced actions are actions 
that are observed to be taken by visible neighbors, while self-sourced 
actions are actions preferred by the individual independently of any ex- 
ternal influences The fourth endorsement, best endorsed neighbor 
sourced, 1s accorded to the action observed by the neighbor that 1s most 
highly valued among all of the visible neighbors of the agent That agent 
is the neighbor that 1s most like the endorsing agent and 1s determined 
by the agent’s endorsements scheme 

The endorsements on other agents are closest actsvity consumption, 
closest activity frequency, or closest activity frequency and consumption 
The idea here ıs that other agents can be valued because their consumption 
per use event 1s closer to one’s own than the consumption per use event 
of any other agent, or because the frequency with which an agent engages 
ın an activity 1s closest to one’s own frequency of engaging ın that activity 
There 1s an added appreciation of any agent whose consumption 1s most 
like one's own ın both frequency of events and consumption per event 
The motivation for these endorsements is the common finding by social 
psychologists that similarity of attitudes and mutual (not necessarily ro- 
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mantic) attractiveness are highly correlated That 1s, individuals tend to 
share the attitudes of those they like and to like those whose attitudes 
they share The attitudes that lead to similar consumption patterns are 
taken here to induce personal attraction which ın turn reinforces the 
similarity of attitudes In this model, attitudes toward authority and to- 
ward social norms determine patterns of water consumption 

It 1s, of course, essential to be able to evaluate which actions and which 
neighbors are the most attractive This requires some means of comparing 
different collections of endorsement tokens Cohen’s original endorse- 
ments approach allocated endorsement tokens to classes of 1mportance 
The action chosen would be that which had the most endorsements of 
the highest class, or, if several had the same number ın the highest class, 
the action that was tied ın the highest class but had the most endorsements 
in the second-highest class If there was another tie at the second-highest 
class of endorsements, the third, or, 1f necessary, the fourth or lower class 
would be used to break the tie 

A more general approach, and that used here, 1s to define a number 
base b and to evaluate each endorsed object according to the formula 


y = 2b — 2, b, 


€;20 e,«0 


where e, 1s a (usually integer) value associated with the zth endorsement 
token Negative values of endorsement tokens indicate, naturally enough, 
that they are undesirable The higher the value associated with an en- 
dorsement token, the higher the class of tokens containing that particular 
token The value of b 1s the importance of an endorsement token relative 
to the value of a token 1n the class below If the base 1s two, then an 
endorsement of class 3 contributes 8 (= 25) to the endorsement value of 
an object while an endorsement of class 2 contributes only 4 (= 2?) For 
values of b larger than the number of tokens 1n any class used to endorse 
any object, the results from this evaluation scheme are the same as from 
Cohen”s evaluation scheme For smaller values of b ıt ıs possible for a 
large number of lesser endorsements to outweigh a small number of en- 
dorsements of greater value 

An agent might have an actions endorsement scheme ın which self- 
sourced 1s an endorsement of class 3, neighborhood. sourced and best 
endorsed neighbor sourced are endorsements of class 2, and globally 
sourced 1s an endorsement of class 1 If the value of the numerical base 
of the endorsement scheme (the value of b) were 1 5, then an action 
endorsed as both nezghborhood sourced and best endorsed neighbor 
sourced would have an endorsement value of 1574157 —225-4 
225 —45 An action endorsed as self-sourced would have an endorse- 
ment value of 1 5” = 3375 The first action would have the higher en- 
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dorsement value and would therefore be the action executed by the agent 
If, however, the numerical base were two, the actions would be equally 
well endorsed since the endorsement in class 3 would have been 2° = 8, 
and the two class 2 endorsements would give an endorsement value of 
2? + 2’ = 8 For any base greater than two, the two endorsements of class 
2 would outweigh the single endorsement of the next higher class 

For each agent, the value of each endorsement token was set at random 
as was the numerical base of the endorsement scheme The desired con- 
sequence of this approach was to allow for heterogeneity among agents 
as well as different agent populations 1n each simulation run Some agents 
would be most influenced by external authority, some by their neighbors, 
and some would be mainly self-directed The probability that an agent 
would fall into any one of these categories was varied 1n order to give an 
indication of the importance and effects of social influence The results 
of different proportions of primarily socially influenced agents are reported 
below 


The General Setup 


The water consumption model focuses upon the behavior of households, 
in particular, how the household-to-household behavioral influence affects 
the aggregate demand for water Thus the heart of the model 1s a network 
of agents each of which represents a single household These are distrib- 
uted randomly on a two-dimensional grid These “households” can only 
interact with those within a certain distance of them vertically and hor- 
izontally—their "neighbors ” The totality of households and their potential 
interactions can be considered to represent a community or cluster 

This setup assumes that individuals tend to form stable social rela- 
tionships with persons with whom they already have common social back- 
grounds and interests, and, once such relationships are formed, the opin- 
ıons and behavior of these individuals tend to be similar In order to 
capture this finding, agents are designed to endorse as most similar to 
themselves those neighbors whose water consumption behavior 1s most 
like their own Such neighbors are then of particular importance ın terms 
of influence The network of all the possible avenues of influence by such 
endorsed neighbors ıs shown ın figure 5 The total web of possible ınflu- 
ences among households 1s not shown as it 1s too dense to be displayed 
sensibly This structure was chosen to be consistent with. what domain 
experts told us about influence between households The result 1s that 
agents are given a nonuniform local network of relationships within which 
each household determines neighbors by which they are most influenced 

As an example of the how this influence 1s captured by the endorsements 
mechanism, we follow the selection of a particular action from the first 
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Rk - Global Biased 
B - Locally Biased 
L3 - Self Biased 


Fic 5 —Influence network of “most similar" households at a typical instant of simulation 
time 


month of its adoption until ıt was replaced six months later A sequence 
of endorsements and action choices 1s reported in table 1 It shows how 
an action was reinforced by a combination of the endorsements recent, 
neighbov sourced, and self-sourced (remembered, but not necessarily re- 
cent), until action 8,472 eventually overtakes ıt by being nezghbor sourced 
four times, including being endorsed by the “most alike neighbor" How 
many nzeighbov sourced endorsements are necessary to “overcome” en- 
dorsements such as self-sourced and recent depends upon the weightings 
the agent 1s given during the model initialization 

As a result of the learning and decision making by households, a self- 
reinforcing household-to-household ımıtatıon pattern can occur If the 
households are (on the whole) sufficiently biased toward imitating neigh- 
bors, then each household 1n a cluster may copy a substantial part of its 
behavior from those neighbors who have copied the behavior from their 
neighbors, and so on If the households are sufficiently clustered, then 
patterns of behavior may be copied back and forth, thus reinforcing them- 
selves Thus a sort of competition between different patterns of behavior 
and the “locking 1n" of winning behavior can result 

The external environment for each household consists of the temper- 
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TABLE 1 
EFFECT OF ENDORSEMENT ON ACTION CHOICE 


Month 1 Used, endorsed as self-sourced 

Month 2 Endorsed as recent (from personal use) and nezghbor sourced 
(used by agent 27) and self-sourced (remembered) 

Month 3 Endorsed as recent (from personal use) and nezghbor sourced 
(agent 27 1n month 2) 

Month 4 Endorsed as nezghbor sourced twice (used by agents 26 and 27 in 
month 3), also recent 

Month 5 Endorsed as nezghbor sourced (agent 26 1n month 4), also recent 

Month 6 Endorsed as nezghbor sourced (agent 26 ın month 5) 

Month 7 Replaced by action 8,472 (appeared ın month 5 as ne:ghbovr 


sourced, now endorsed four times, including by the most alzke 
netghbor—agent 50) 


ature and precipitation, the exhortations of the policy agent (which used 
to be a government agency and 1s now the local water company), and, 
critically, the neighboring households Each household has a number of 
different water-using devices such as showers, washing machines, and 
hoses The distribution and properties of these devices among agents 
matches the distribution of these appliances among households as ob- 
tained 1n a recent survey The numerical outputs include the volume of 
water used by each household in each appliance, from which we calculate 
total domestic water consumption and the time patterns of the effects of 
new technologies on water consumption 

The domestic water consumption model captures hydrological phe- 
nomena as well as processes of social influence that are beheved to affect 
the domestic demand for water The hydrological element determined the 
addition to the water supply each month given actual temperature and 
precipitation as well as the month (since hours of daylight 1nfluence water 
transpiration—from plants and animals—and evaporation into the atmo- 
sphere) Since our temperature and precipitation data were gathered 
monthly, the month was the level of time step used to calculate 
consumption 

Each month, each household adjusts its water-using habits ın terms of 
the frequency with which it uses each device, and whether 1t acquires 
new devices (such as power showers) The household makes this adjust- 
ment based on the following what devices it has, its existing habits, what 
its neighbors do (except in the case of private devices such as toilets), and 
what the water company may be suggesting (in times of drought) The 
weighting that each household uses for each of these factors 1s different 
and 1s set by the modeler In many of the runs 1t was set such that about 
55% of the households were biased toward imitating a neighbor, 15% 
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were predisposed to listen to the water company, and the rest were largely 
immune to outside suggestion Actual proportions ın real communities are 
unknown, but anecdotal accounts suggest they vary greatly between com- 
munities While there are undoubtedly too many degrees of freedom 1n 
the model to infer from ıt any particular distribution of openness to 1n- 
fluence across households, the model does indicate that the extent to which 
the model yields more definite results depends on that distribution 

We note that, 1n the model, social embeddedness amounts to behavior 
that 1s the consequence of social interaction with agents influencing one 
another The higher the proportion of agents whose behavior 1s 1nfluenced 
primarily by neighbors like themselves, the greater the degree of social 
embeddedness represented 1n the model 

In order to assess the effects of social embeddedness on aggregate be- 
havior, we allocated different probabilities of endorsement schemes, giving 
the highest value to zezghbor sourced in different simulation runs The 
results are exhibited ın figure 6 where it is seen that for a relatively low 
degree of social embeddedness (30% of households are influenced more 
by neighbors than by authority or by anyone else), there 1s a wide scatter 
of demand series for households, while for higher degrees of social em- 
beddedness (55% and 80% primarily neighbor influenced), the scaled time 
series of demands are much more closely clustered This result suggests 
that the emergence of social norms may be explained by the importance 
of social embeddedness as a determinant of individual behavior While 
not surprising, the result does tend to confirm that agent-based social 
simulation provides a means of relating individual behavior to macrolevel 
social properties The result also indicates that, beyond some minimum 
degree of social embeddedness, the simulation results from the model are 
not sensitive to particular parameter settings in the endorsement 
mechanism 

The “policy agent” represents the body responsible for issuing guidance 
to consumers as to water use ın times of water shortage (currently the 
policy agent 1s the individual water company 1n each area) In the model 
there 1s a calculation of the level of ground water derived from the ch- 
matological data, and the policy agent starts issuing suggestions during 
the second month when the ground 1s dry In subsequent dry months, its 
suggestion 1s to use increasingly less water The model structure 1s ıllus- 
trated ın figure 7 

The model does not attempt to capture all the influences upon water 
consumption In particular, ıt does not include any direct influence of the 
weather upon microcomponent usage, nor does it include any inherent 
biases toward increased usage due to background social norms such as 
increased cleanliness The behavior of the policy agent 1s not sophisticated 
since it 1s the reaction of the households that 1s 1mportant here 
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Fic 6 —Household demands scaled to average = 100 for different degrees of social ınflue 
from an independent run of the model) 


Sociology and Simulation 






*Actıvıty 







*Freguency 





«Volume 





*Temperature 


eRainfall 






Aggregate Demand 






«Sunshine 


Fıc 7 —The structure of the model of domestic water demand 


The time series from an typical run 1s shown m figure 8 It exhibits 
clustered volatility similar to that seen ın the water usage data shown ın 
figure 2 The histogram of the relative changes 1s shown 1n figure 9—this 
shows less leptokurtosıs than was seen ın the corresponding histogram 
for the water usage shown ın figure 3 However, as shown ın figure 10, 
increasing aggregation of days up to 32 clearly reshapes the histogram of 
relative changes toward the shape and extent of leptokurtosis of the sım- 
ulated monthly (average 30-day) output from the agent-based simulation 
model Since we know that one means of generating leptokurtic data with 
clustered volatility 1s by specifying models with strong, dissipative inter- 
action among metastable elements, and since the model that describes 
regulator and water supply company perceptions of the behavior of house- 
holds conforms to those conditions, we can state that those perceptions 
are consistent with the data We can also state that because of that con- 
sistency between perceptions and data, they have no warrant for planning 
future water supply capacity on the basis of regression models of domestic 
water demand This result exactly parallels the discussion of the repre- 
sentative articles from the AJS 


Descriptive Statistics vs Statistical Models 


The presumption that statistics 1s "science" but qualitative research 1s 
“mere anecdote” 1s at variance with our experience of participatory agent- 
based modeling Our agent-based models often produce time-series data 
that are characterized by clustered volatility and high levels of leptokur- 
tosis This result does not occur because we tune our models to produce 
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FIG 8 —Simulated relative change ın monthly consumption 


these kinds of time series but rather seems to be a consequence of the 
characteristics we put into our models, namely social embeddedness, the 
prevalence of social norms, and individual behavior The reason we often 
make models with these characteristics 1s that these are a necessary part 
of making our models consistent with the observations of sociologists and 
other domain experts concerning the behavior of the actors concerned 
The fact that the time series derived from social phenomena often have 
these same characteristics may indicate that the models are on the right 
track 

Our approach ısın sharp contrast to many statistical approaches, where 
it 1s assumed that conventional statistical models and tools will apply 
These assumptions are often completely unwarranted and seem to be 
made purely because of a perceived lack of choice The burden of proof 
should be on those who wish to make these assumptions, otherwise they 
wil be merely *muddying" the debate with statistical artifacts by pre- 
senting “results” that do not fundamentally derive from the social phe- 
nomena with which we are concerned It is not possible to prove that 
such assumptions are wrong—ıt is always logically possible that there ıs 
a well-defined, fixed distribution function underlying the time series, even 
when all the indications of available data are to the contrary One can 
always assume that such a distribution can be recovered by means such 
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Fic 9 —Relative changes ın simulated monthly water consumption 


as Increasing the sample size, considering a longer series, and excluding 
“exceptional” events 

The time series that result from social phenomena and the agent-based 
models of these phenomena provide some evidence against these sorts of 
assumptions Agent-based models that seek to be consistent with the an- 
ecdotal evidence of how social actors behave often produce time series 
that are best described by power laws, Pareto distributions, and the like 
The extreme “peaks” observable ın the time series they produce do not 
always result from substantial or exceptional factors but are intrinsic 
products of the processes captured by the models This adds credibility 
to the hypothesis that similar processes could be responsible for the similar 
time series derived from social phenomena 

If one insists on assuming the time series obtained from social phe- 
nomena cannot only be described by fixed distributions ın a post hoc 
manner but are also generated by them (in some underlying or a prior 
way), then the high levels of leptokurtosis and clustered volatility do not 
support an assumption that such distributions as the normal or binomial 
are applicable, but rather fit those like the Pareto distribution With the 
Pareto distribution, high levels of leptokurtosis indicate that the second 
moment (the variance) 1s undefined—ın other words, that as you take 
longer samples, the variance will not tend to a limiting, “stable” value 


1123 


Group size 2 Group size 8 





Fic 10 —The frequency distributions of the relative changes, grouped ın sizes 2, 8, and 3 
09, mean = 01 For group size 8, scale is 0-300, SD = 03, mean = 005 For group size. 
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In fact, 1n over half of 11s parameter space, the Pareto distribution does 
not even have a defined mean 

If we were to suppose that a particular social process generates data 
described by a Pareto distribution function, then the prevalence of lep- 
tokurtosis implies that, 1n general, that distribution function has no defined 
variance and very likely has no defined mean That 1s, increasing the 
number of observations will not result ın the convergence of an undefined 
mean or variance to any fixed population value 

Considered as a data-generating process, social institutions ın this case 
would not produce numerical data that can be treated as zf they were 
drawn from an underlying population distribution. We would not expect 
social environments marked by clustered and unpredictable volatility to 
generate data described by a fixed frequency distribution This 1s because 
social institutions frequently respond to episodes of volatility by changing 
the behavior of the institution A clear example 1s the changes in the 
norms and rules governing trading 1n organized financial markets These 
rules are typically changed after a serious downturn, as happened after 
the financial panics of the 19th century (after which stocks were 1ssued 
fully paid up), certainly after the 1929 crash (when the value of debt that 
stockbrokers could lend customers against their portfolio values was lim- 
ited by law 1n the United States), and, more recently, after the 1987 
downturn in world stock markets (when trading pauses were introduced 
in conditions where automated trading appeared to have become un- 
stable) Other examples include corporate reorganizations, consequences 
of technical change, and changes ın political regimes (eg, the French 
Revolution or the “revolutions” ın Eastern Europe ın 1989) 

That there 1s no evidence that the variance of daily domestic water 
consumption 1s settling down to a defined variance can be seen 1n figure 
11’ This figure is similar to the (lack of) convergence found by Mandelbrot 
(1963) 1n financial time series 

It 1s likely that much of the apparent normality of some socially derived 
time series 1s an artifact of the way they are produced As we have seen, 
a consequence of the central limit theorem 1s that any averaging ın the 
construction of the data (either implicit as ın summing over fixed time 
periods or explicit) will tend to make the data appear more normal For 
example, if daily time series have a high descriptive leptokurtosis, then 
the monthly series will have a much lower level of leptokurtosis To 
illustrate this we took the water consumption data illustrated 1n figure 2 
and averaged them in consecutive groups (e g , the first four numbers, the 
second four, etc) and then constructed frequency distributions of the 
resulting figures Three of these distributions were illustrated in figure 


" The measure of cumulative variance used here is taken from Mandelbrot (1963) 
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Fic 11—The cumulative variance of log changes 1n daily domestic water consump 
tion (liters per day) ın a metered UK neighborhood (derived from the figures illustrated ın 
fig 1) 


10 The distribution becomes clearly less leptokurtic as the grouping size 
increases The kurtosis for different levels of averaging 1s shown ın 
table 2 

This result serves to illustrate the difference between descriptive sta- 
tistics and statistical models of the underlying process The former 1s a 
numerical description of some data while the latter 1s a model of the data- 
generating process—conflating the two can be misleading 

The lack of an underlying fixed distribution behind the phenomena 
indicates a capacity to generate dynamic and/or structural change en- 
dogenously This change results, in part, from the ability of a system to 
“rewire” itself into new configurations It seems plausible that this ability 
is limited by the complexity of the system Although agent-based models 
are considerably more complex than analytic models, they are still formal 
systems, and hence, much simpler than the social processes they seek to 
capture The simplistic nature of the influence mechanisms ın the water 
demand model and its limited size (40 or 100 agents, depending on the 
purpose of the experiment) are probably not sufficient to bring about such 
structural changes endogenously 

It seems essential to us that 1n good (social) science, observation of how 
processes actually occur should take precedence over assumptions about 
the aggregate nature of the time series that they produce That 1s, gen- 
eralization and abstraction are only warranted by an ability to capture 
the evidence Simply conflating descriptive statistics with a (statistical) 
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TABLE 2 
KURTOSIS OF THE FREQUENCY DISTRIBUTION OF RELATIVE CHANGES OF WATER 
CONSUMPTION AFTER BEING BINNED INTO DIFFERENT SIZE GROUPS 


GROUP SIZE 
1 2 4 8 16 32 
Kurtosis 2,490 0 130 1 825 395 335 157 





model of the underlying processes does not render the result more scientific 
but simply more quantitative 

In the end, this discussion comes down to deep assumptions about the 
nature of the phenomena with which one 1s concerned, especially about 
the characteristics of what might be considered as “noise " In any model 
one will only be able to capture certain limited aspects of what one 1s 
modeling—the rest can be thought of as “noise” However one cannot 
assume that this noise 1s random (in a statistical sense), just because it 1s 
unpredictable One cannot even assume that this noise will obey the “law 
of large numbers"—which, broadly stated, 1s the property that random 
noise will cancel out faster than any “signal” as sample size increases 
Models that illustrate this possibility include Brian Arthur’s “El Farol 
Bar” model (Arthur 1994) as investigated by Edmonds (19995) What we 
do not understand about social phenomena—even through the lens of 
aggregate time series—cannot be dealt with as simply as an engineer might 
treat meaningless electrical fluctuations 


CONCLUSION 


The water consumption model presented in this article represents an at- 
tempt to describe a social process that 1s consistent with both the qual- 
itative data provided by stakeholders and other domain experts and the 
observed characteristics of time-series data concerning domestic water 
consumption Each successive version of the model has been subject to 
validation by domain experts Their validation concerned both the rep- 
resentation of agent behavior and interaction as well as characteristics of 
the aggregate time-series output (especially the recovery of demand after 
drought-induced restrictions have been rescinded) 

This approach differs fundamentally from the usual approaches to sta- 
tistical research and extends the usual approaches to qualitative research 
Both of these features of our approach depend crucially on the use of 
agent-based social simulation models 

Although agent-based modeling 1s ın widespread use, the agents are 
not universally 1mplemented as descriptions of the behavior of observed 
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social entities It 15 not uncommon, for example, for agents to be specified 
as genetic algorithms (Chen and Yeh 2002, LeBaron, Arthur, and Palmer 
1999, Palmer et al 1993) or as artificial neural networks (LeBaron 2002) 
or as players in a game-theoretic setting (Macy and Sato 2002)* While 
these agent designs are sometimes implemented 1n simulation models ad- 
dressing issues of concern to sociologists such as trust or social norms, 
the agents themselves cannot be compared directly with the behavior of 
the individuals or composite social entities they are intended to represent 
Such designs do not facilitate validation of the agents using qualitative 
data and qualitative research methods Particularly 1n the case of models 
of financial markets, such agent-based models do generate leptokurtic 
time-series data due to clustered volatility Typically, these models satisfy 
the four conditions of agent metastability, interaction, social influence, 
and being slowly driven We do not know whether all such models will 
generate leptokurtosis and volatility, but we do know that they commonly 
do * 

What these models show ıs that the observed features of aggregate 
time-series data for financial markets, macroeconomies, and markets for 
fast-moving consumer goods can be the result of data-generating processes 
ın which there ıs no individual maximizing behavior and ın which ın- 
teraction among individuals 1s crucial However the TVP models dis- 
cussed above also produce aggregate time-series data with the same ob- 
served properties And these models are intended to be consistent with 
the economic rational expectations hypothesis based on the presumption 
that individuals do maximize utility and that they are not socially em- 
bedded Although the TVP models, unlike the agent-based models, have 
no formally elaborated microfoundation, they are neither more nor less 
well validated than the agent-based models implementing agents as ge- 
netic algorithms, artificial neural networks, or game-theoretic strategies 

We have shown that descriptive statistics are insufficient to identify 
the processes that generated the data We have also shown that, provided 
the data are sufficiently fine grained, some models of data-generating 
processes can be shown to be consistent with that data and some can be 
excluded But choosing among the classes of models that purport to de- 
scribe the underlying data-generating process requires some additional 
discriminants To avoid mere tautology, such discriminants must be em- 
pirically based Additional statistical data are of no help since we have 


* Macy (1991) reviews the earlier literature ın this area 

? Jensen (1998) points out that there are only a few, rather special analytical results 
on systems of this type Consequently we can only conjecture about the generality of 
the relationship between these four conditions and unpredictable, clustered volatility 
and hence, leptokurtosis 
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already shown that such data cannot be used to discriminate among the 
possible data-generating processes If we are not to use statistical data 
for this purpose, we must use qualitative data or at least data describing 
microbehavior In designing and developing our domestic water con- 
sumption model, we used qualitative data and assessments provided by 
domain experts, and, ın particular, by stakeholders ın the water resource 
management process As a result, the models were assessed independently 
in relation to the macrolevel statistical data and the microlevel qualitative 
data—the process we have called cross-validation 

We believe it to be both interesting and ımportant that the process of 
validating the model qualitatively at the microlevel 1s clear and straight- 
forward, while the validation using macrolevel statistics 1s suggestive and 
based on visual impressions of clustered volatility This experience stands 
on its head the notion that qualitative research 1s mere anecdote while 
statistics 1s science 

Our aim has been to demonstrate that agent-based models have the 
particular strength that they can be validated with respect to both qual- 
ıtatıve and statistical data and at both micro- and macrolevels In so 
doing, we have identified a number of further issues that need to be 
addressed 

We have reported that some of our models generating leptokurtosis and 
clustered volatility do and some do not produce time-series data with 
apparently convergent moments of the frequency distributions of those 
data The differences in the models that lead to these different results 
might well be important, but we have no 1dea of what causes those dif- 
ferences In particular, we do not know whether these differences are 
artifacts of the model or something the model represents about social 
relations and individual behavior We simply point out that if the distri- 
butions are stable, then a nonconvergent variance 1s consistent with a 
Levy distribution and a convergent variance 1s not This result will be 
important to those who investigate empirical distributions of personal 
incomes, firm sizes, market shares, city sizes, or any of the other power- 
law distributed, cross-sectional, and time-series economic data, since 
power-law distributions are a consequence of the Levy distribution 

A second problem ıs that the notion of the “statistical signature” 1s 
without formal content If we are to use the features of social statistical 
data to 1dentify classes of possible models of social-data-generating pro- 
cesses, then 1t would clearly be useful 1f we could relate formal descriptors 
of the data to features of suitable data-generating processes 

We have also pointed out that the common practice 1n social simulation 
of producing a series of runs and then reporting summary statistics such 
as median values can hide important episodes of volatility Vet, the reasons 
for running suites of simulation experiments—to identify robust properties 
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of the models—is surely valid There ıs a tension here that needs to be 
addressed 

Finally, we do not claim to be experts 1n qualitative research We are, 
however, aware that some social researchers are uncomfortable about 
generalizing from the experiences of individual organizations and ınstı- 
tutions We do not speculate as to whether there 1s any justice ın this 
position but we do claim that more formal representations of qualitative 
evidence by agent-based social simulation models provide a means of 
identifying any general properties of social systems that are consistent 
with independently observed macrolevel data 
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A recursive analysis of network and institutional evolution 1s offered 
to account for the decentralized structure of the commercial field of 
the life sciences Four alternative logics of attachment—accumu- 
lative advantage, homophily, follow-the-trend, and multiconnectiv- 
ity—are tested to explain the structure and dynamics of mteror- 
ganizational collaboration ın biotechnology Using multiple novel 
methods, the authors demonstrate how different rules for affiliation 
shape network evolution Commercialization strategies pursued by 
early corporate entrants are supplanted by universities, research 
institutes, venture capital, and small firms As organizations increase 
their collaborative activities and diversify their ties to others, co- 
hesive subnetworks form, characterized by multiple, independent 
pathways These structural components, ın turn, condition the 
choices and opportunities available to members of a field, thereby 
reinforcing an attachment logic based on differential connections to 
diverse partners 


INTRODUCTION 


The images of field and network are common ın both contemporary phys- 
ical and social science In the physical sciences, fields are organized by 
information ın the form of geometric patterns The study of the geometry 


' We thank the Santa Fe Institute (SFI) for providing the venue where these ideas 
were initially discussed and much of the work was done We are especially grateful 
to John Padgett, organizer of the states and markets group at SFI for his support and 
insight We have benefited from comments from the audience at seminars given at the 
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of fields has attracted considerable interest in the statistical mechanics of 
complex networks Research by physicists interested in networks has 
ranged widely from the cellular level, a network of chemicals connected 
by pathways of chemical reactions, to scientific collaboration networks, 
linked by coauthorships and cocitations, to the World Wide Web, an 
immense virtual network of websites connected by hyperlinks (Albert, 
Jeong, and Barabası 1999, Jeong et al 2000, Newman 2001, Watts and 
Strogatz 1998) Albert and Barabâsı (2002) and Newman (2003) provide 
excellent overviews of this burgeoning literature on the network topology 
of different fields, highlighting key organizing principles that guide ın- 
teractions among the component parts 

In the social sciences, however, analyses of fields and networks have 
been oddly disconnected We say oddly because the study of the macro- 
dynamics of networks should be central to the understanding of how 
fields evolve This lack of connection 1s rooted in several features of 
contemporary research An abundance of research 1n network analysis 
examines why ties form between two actors and the consequences of 
particular network positions Salancik (1995) observed, however, that 
most network research has taken an individual-level perspective and 
missed out on the opportunity to illuminate the structure of collective 
action McPherson et al (2001) note that there are few studies that employ 
longitudinal data to analyze networks Burt (2000) has voiced a similar 
concern that most studies of network structure are cross-sectional In the 
most comprehensive text on network methods, there 1s only a paragraph 
on network dynamics 1n a section on future directions (Wasserman and 
Faust 1994) Thus while some progress has been made analyzing the 
dynamics of dyads (e g, Lincoln et al 1996, Gulati and Gargiulo 1998, 
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Stuart 1998), little attention has been given to the evolution of entire 
networks 

There are a number of excellent studies of the structuring of specific 
organizational fields (DiMaggio 1991, Thornton 1995, Dezalay and Garth 
1996, Ferguson 1998, Scott et al 2000, Hoffman 2001, Morrill and Owen- 
Smith 2002) An organizational field 1s a community of organizations that 
engage 1n common activities and are subject to similar reputational and 
regulatory pressures (DiMaggio and Powell 1983) Such fields have been 
defined as “a network, or a configuration, of relations between positions” 
(Bourdieu 1992) and as “centers of debates ın which competing interests 
negotiate over the interpretation of key issues” (Hoffman 1999, p 351) 
Fields emerge when social, technological, or economic changes exert pres- 
sure on existing relations and reconfigure models of action and social 
structures But despite the relational focus on how different actors and 
organizations constitute a recognized arena of social and economic activ- 
ity, studies of fields have not analyzed the interactions of multiple, over- 
lapping networks or the regulated reproduction of network ties through 
time This linkage between network dynamics and the evolving structure 
of fields needs to be made ın order to make progress ın explaining how 
the behavior of actors or organizations of one kind influence the actions 
of orgamzations of another kind 

The goal of this article 1s to account for the development and elaboration 
of the commercial field of biotechnology by showing how the formation, 
dissolution, and rewiring of network ties over a 12-year period, from 1988 
to 1999, have shaped the opportunity structure of the field ^ By mapping 
changing network configurations, we discern how logics of attachment 
shift over time, and we chart multiple influences on the varied participants 
in the field Our effort ıs part of a more general move ın the social sciences 
to analyze momentum, sequences, turning points, and path dependencies 
(see Abbott [2001] for an overview) By hnking network topology and 
field dynamics, we consider social change not as an invariant process 
affecting all participants equally, but as reverberations felt in different 
ways depending on an organization's institutional status and location in 
the overall network as that structure evolves over time Our aim 1s to 
illuminate how patterns of interaction emerge, take root, and transform 
with ramifications for all of the participants 

We develop arguments concerning how the topology of a network and 


? We use the term field rather than :ndustry or population intentionally Biotechnology 
1s not a separate industrial sector with well-defined boundaries Universities, govern- 
ment labs, nonprofit hospitals, and research institutes are a critical part of the field, 
while on the commercial side, both established pharmaceutical firms and dedicated 
biotechnology companies are involved in bringing new medicines to market Thus, 
field captures the diversity of organizations more aptly than any other term 
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the rules of attachment among its constituents guide the choice of partners 
and shape the trajectory of the field As organizations enter a field and 
relationships deepen and expand, significant structural changes occur To 
analyze and understand these emergent network structures, we use a 
triangulation of methods We first analyze the expansion of the network 
to see 1f the process 1s random or uniform Prior research suggests that 
as new organizations join the network, there will be an attachment bias 
that increases the probability of linking to an organization that already 
has ties (de Solla Price 1965, 1980, Barabâsı 20028) We go further and 
assess whether other attachment processes are operative as well We map 
the development of the field by drawing network configurations to create 
a framework with which to view network dynamics We use the software 
package Pajek (de Nooy, Mrvar, and Batagelj 2005) for the representation 
of network dynamics Pajek allows us to analyze the nearly 2,800 nodes 
in our sample and to identify cohesive subsets such as multiconnected 
components (White and Harary 2001, pp 12-14) We present a small 
selection of these network visualizations to highlight both the evolving 
topology of the field and the processes by which new ties and organizations 
are added (The full “movie,” with year-to-year representations of the 
topology and new additions to the network, may be viewed 1n the online 
version of AJS) We then turn to a statistical examination of network 
formation and dissolution and assess the effects of alternative mechanisms 
of attachment Using McFadden’s (1973, 1981) discrete choice model, a 
variant of the conditional logit model, we test to see 1f the basis of at- 
traction 1s accumulative advantage, similarity, follow-the-trend, or 
multiconnectivity 


The Topology of Large-Scale Networks 


A variety of researchers ın physics and sociology are studying the structure 
of large-scale networks with the intuition that complex adaptive systems 
evince organizing principles that are encoded ın their topology Large- 
scale networks typically have characteristic signatures of local structure, 
such as clustering, and a global structure, such as average distance be- 
tween nodes Local and global characteristics of networks help to define 
network topologies such as small worlds, which are large networks with 
both local clustering and relatively short global distance Watts (1999) 
showed that adding only a handful of remote links to a large network 
where the level of local clustering 1s high (e g , friends of friends are friends) 
is sufficient to create a small world network Watts and Strogatz (1998) 
helped to revitalize the earlier line of research introduced by Milgram 
(1967) and developed by White (1970) The wide appeal of the small world 
idea had been portrayed ın the arts, ın John Guare’s play Szx Degrees of 
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Separation, and 1n the popular Kevin Bacon game, where virtually every 
Hollywood actor 1s linked through a few steps Even a small proportion 
of randomly distributed ties can knit together diverse clusters of nodes 
to produce small world phenomena Researchers have applied the small 
world concept to a wide range of activities, including scientific collabo- 
rations (Newman 2001), corporate board interlocks (Davis, Yoo, and 
Baker 2003), national ownership networks (Kogut and Walker 2001), ın- 
vesment bank syndicates (Baum, Shipilov, and Rowley 2003), and Broad- 
way plays (Uzz and Spiro 2005) 

Watts and Strogatz’s (1998) formalization of the small world problem 
lacked any role for network hubs—nodes with an unusually large number 
of ties or edges This limitation also held for early models of random 
networks, in which an equal probability of any given pair of nodes being 
connected generated only a mild tendency for nodes to differ in their 
number of edges ? But research on citation networks, however, has shown 
highly skewed distributions, with most articles having few citations, while 
a handful have an exceedingly large number Lotka (1926) and de Solla 
Price (1965, 1980) showed that 1n degree distributions of citation networks, 
the proportion of nodes with degree k varies as a function of 1/k°, that 
1s, by the inverse power law P(k) ~ 1/k*, where alpha ıs the power coef- 
ficient Barabâsı and Albert (1999, p 510) and Barabâsı (2002a, p 70) 
popularized the term “scale free” for such networks,* and they confirmed 
that network growth with preferential attachment according to degree 


* Even when edges form with equal probability for all pairs of nodes 1n a network, 
there 1s considerable inequality 1n the distribution of the number of edges for different 
nodes The tail of the degree distribution of a simple random process of tie-formation 
will be truncated, however, by exponential decay 1n the number of nodes at successively 
higher degree The degree distribution of nodes ın a highway network, eg, tends to 
be exponential, more like a random network than the degree distribution of nodes in 
today’s US airlines network, with hubs that jump over many of the nodes connected 
by spokes Note that these differences are, in part, a matter of intentional design and 
not solely a function of geography, there was a time when the airline network was 
more like a highway network 


*'The meaning of scale invariant or scale free for a power law 1s that the coefficient 
does not vary from scale to scale as magnitude varies from 1 to 10 to 100, etc Put 
differently, for a network with a power-law tail to the degree distribution, there 1s no 
characteristic number of edges per node, as ın a bell-curve-shaped distribution or 
exponential decay Barabâsı's (20025) organizing motif is that networks with power- 
law degree distributions have a characteristic scale-free signature of self-organized 
systems Analogy 1s made to the scaling of the frequency of earthquakes 1n relation to 
their energetic intensity, e g , which tends to follow a power law This would 1mply 
that there 1s no typical scale for earthquakes and suggests that the physical mechanism 
for large earthquakes 1s the same as that for the small ones The theme of *the same 
mechanism" has not been established for social networks, however 
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predicts a scale-free tail of the degree distribution ? The well-connected 
nodes that newcomers attach to become hubs that create short paths 
between many pairs of nodes ın the network 

Preferential attachment to higher degree leads to a dynamic of rich- 
get-richer Power-law tails of degree distributions are present 1n very di- 
verse kinds of networks In the movie actor network, for example, new 
actors tend to start their careers 1n supporting roles accompanying famous 
actors, and in science new publications cite well-known papers Attach- 
ment bias in network formation bears strong similarity to the more general 
phenomenon of accumulative advantage (Merton 1973), 1n which those 
who experience early success capture the lion’s share of subsequent re- 
wards We use early starter accumulative advantage and preferential at- 
tachment as baseline arguments, since they provide potential explanations 
for growing inequalities 1n the process of network expansion But not all 
early entrants turn out to be winners, and some latecomers attain prom- 
ınence As the saying goes, the early bird may catch the worm, but it 1s 
the second mouse that eats the cheese Similarly, Albert and Barabası”s 
(2002) formalization of a “scale-free” class of networks, 1n which the prob- 
ability that a new entrant will choose to lınk to an incumbent node Js 
proportional to the number of links the incumbent already has, 1s elegantly 
simple but overgeneralized Other attachment processes, or a combination 
of diverse mechanisms, can produce power law degree distributions Our 
analyses test for multiple, probabilistic biases 1n the processes of network 
growth 

We enter the discussion of network dynamics with data from a field 
where social, political, economic, and scientific factors loom large 1n shap- 
ing patterns of attachment among the participants In earlier work on the 
biotechnology industry, Powell, Koput, and Smith-Doerr (1996) found a 
habihty of disconnectedness, 1n which older, less-linked organizations were 
the most likely to fail Certainly, early entrants have more time than later 
arrivals to establish connections, but Powell et al (1996) found that how 
connections were established and what activities were pursued were crit- 
ical Biotechnology firms had to both make news and be in on the news, 
that 1s, they needed to generate novel contributions to the evolving science 
while maintaining the capability to evaluate what other organizations 
were doing The pathway to centrality in the industry network was 
through research and development collaborations Other routes were e1- 


* Here the actual attachment probability of new nodes with incumbents, P(k), 1s pro- 
portional to k where (k) ıs the degree of the incumbent The preferential attachment 
probability generates a degree distribution in which the frequency of nodes with a 
given degree d 1s a function fd) of 1/d^, where a is the power-law coefficient and can 
be calculated from the slope of the linear regression line on a log-log plot of d and 


fd) 
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ther ineffective or much slower ın generating centrality Moreover, ın a 
highly competitive world ın which it 1s not easy to rest on past accom- 
plishments, firms that do not expand or renew their networks lose their 
central positions At the same time, resource-rich participants are more 
capable of altering their positions by reconfiguring their networks 

Biotechnology 1s characterized by a high rate of formation and disso- 
lution of linkages Connections are often forged with a specific goal in 
mund, such as taking a company public or selling and distributing a new 
medicine Once the task 1s completed, the relationship 1s ended and suc- 
cessful collaborators depart gracefully There 1s a good deal of entry and 
exit nto the field, with new entrants joining at particular times when 
financing 1s available and novel scientific opportunities can be pursued 
The rate at which new nodes appear m the network 1s, ın part, determined 
by the success that existing nodes have in making progress on a tech- 
nological frontier Moreover, many of the participants in the field are 
“multivocal,” that 1s, they are capable of performing multiple activities 
with a variety of constituents (Burt 1992, Padgett and Ansell 1993, White 
1985, 1992) But multivocahty 1s not distributed evenly, those organiza- 
tions that are more centrally located 1n the industry have access to more 
sophisticated and diverse collaborators and have developed richer pro- 
tocols for collaboration (Powell et al 1996) 

To illustrate the questions we are pursuing, consider a contemporary 
dance club, where revelers compete to get inside, once inside, they may 
dance ın groups or repeatedly with only one partner or serially with many 
partners during an evening The mix of available partners changes as the 
evening goes on, and diverse styles of music are played 1n different rooms 
While new partners may be chosen, the imprint of past choices often 
lingers Some dancers may be highly sought after and some music may 
attract more dancers Or, by way of contrast, consider a more formal 
setting, such as an early 20th century Swedish military ball, where the 
young officers would ride ın a horse-drawn carriage around Stockholm 
in advance of the event with an official dance card and visit the homes 
of young women to obtain permission to dance with them from their 
parents By the time of the dance, the aspiring young officers may have 
filled their dance cards and rehearsed their repertoire of conversation and 
dance ^ In either complex setting, an analytically rich set of questions 
follows Who dances which dance? With whom? When? To address these 
questions, one needs information about the cast of participants, the rep- 
ertoire of activities performed, and the sequences linking partners and 


5 We thank Örjan Solvell, Stockholm School of Economics, for the detailed description 
of the status dynamics of the social world of his grandfather, Otto Hammar, and for 
a photocopy of his grandfather's 1911 dance card 
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activities As the combinations of partners and dances unfold, collective 
dynamics emerge Individual choices may cumulate into a cascade, re- 
sulting 1n everyone following similar scripts Or trends may cluster and 
find coherence only in small, densely connected groups Choices made 
early may strongly affect subsequent opportunities, but path dependence 
might be offset by a constant flow of new arrivals and departures The 
challenge to understanding any such highly interwoven system is to relate 
the behavior and dynamics of the entire structure with the properties of 
its constituents and their interactions. The aim 1s to discern what types 
of actors and relationships are most critical ın shaping the evolution of 
the field at particular points 1n time 

We assess different sources of attachment bias and test to see if these 
simple rules guide the process of partner selection, and if so, for which 
participants and at what points 1n time? We supplement the idea of ac- 
cumulative advantage with alternative mechanisms that sociologists have 
repeatedly found to be important in the formation of social and economic 
ties and the evolution and replication of social structures The first al- 
ternative to early advantage or rich-get-richer 1s homophily (McPherson 
and Smith-Lovin 1987), a process of social similarity captured best 1n the 
phrase, “birds of a feather flock together” A second alternative 1s based 
on following the trend, thus the participants observe others and attempt 
to match their actions to the dominant behavior of the overall population 
(White 1981, DiMaggio and Powell 1983) In this context, action 1s trig- 
gered by a sense of necessity, by a desire to keep pace with others by 
acting appropriately (March and Olsen 1989, pp 160-62) This pattern 
may also arise from participants reacting ın similar ways to common 
exogenous factors 

We contrast arguments based on rich-get-richer processes with mech- 
anisms such as homophily and appropriateness, and propose a new model 
based on multiconnectivity (the multiple linking of partners both directly 
and through chains of intermediaries) and a preference for diversity To 
pursue the dance imagery, homophily suggests that, when you select a 
new partner, he or she is someone with attributes similar to those of your 
previous partners A rich-get-richer process involves competing for the 
most popular dancer Following the trend entails choosing both a partner 
and a dance that are comparable to the choices of most other participants 

A preference for diversity, however, suggests a search for novelty and 
the inclination to move in different communities and interact with het- 
erogeneous partners Our ideas concerning multiconnectivity involve sev- 
eral intuitions A cohesive network, with plural pathways, means partic- 
ipants are connected through different lınkages Thus many nodes must 
be removed to disconnect such a structure, meaning that such groupings 
are highly resilient The larger the number of pathways for communication 
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and exchange, the more rapidly news percolates through the network In 
turn, when more knowledge 1s exchanged, participants attend to their 
network partners more intensively (Powell 1990) The enhanced flow of 
ideas and skills then becomes an attraction, making the network more 
appealing to join. Rapid transmission and diverse participants enhance 
both the likelihood of recombination and the generation of novelty In the 
language of organizational learning, diversity entails a preference for ex- 
ploration over exploitation (March 1991) Of course, we do not necessarily 
expect that one mechanism dominates at all time periods and exerts equal 
gravitational pull on every participant The very essence of dynamic sys- 
tems 1s that they constantly change over time The actors may well play 
by different rules at different points 1n time, depending on the experience 
of their partners and their position ın the social structure Moreover, 
alternative organizing principles may be dominant at different stages 1n 
the formation of the network Framed more formally, the alternative 
mechanisms can be stated 1n a series of hypotheses 

HYPOTHESIS 1 —Network expansion occurs through a process ın which 
the most-connected nodes receive a disproportionate share of new ties 
(accumulative advantage) 

HYPOTHESIS 2 —Network expansion follows a process an which new 
partners are chosen on the basis of their similarity to previous partners 
(homophily) 

HYPOTHESIS 3 —Network expansion entails hevdhke behavior, with 
participants matching their choices with the dominant choices of others, 
either ın mutual response to common exogenous pressures or through ım- 
ılatıve behavior (follow-the-trend) 

HYPOTHESIS 4 —Network expansion reflects a choice of partners that 
connect to one another through multiple independent paths, which an- 
creases veachability and the diversity of actors that are reachable 
(multiconnectivity) 


Field Structuration Science Meets Commerce 


Our empirical focus 1s on the commercial field of biotechnology, which 
developed scientifically ın university labs ın the 1970s, saw the founding 
of hundreds of small science-based firms ın the 1980s, and matured ın 
the 1990s with the release of dozens of new medicines This field 1s notable 
for its scientific and commercial advances and its diverse cast of orga- 
nizations, including universities, public research organizations, venture 
capital firms, large multinational pharmaceutical corporations, and ded- 
icated biotech firms (which we refer to as DBFs) Because the sources of 
scientific leadership are widely dispersed and rapidly developing and the 
relevant skills and resources needed to produce new medicines are broadly 
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distributed, the participants 1n the field have found collaboration essential 
The evolving structure of these collaborative ties 1s the focus of our net- 
work study Concomitant with changes ın the network, an elaborate sys- 
tem of private governance has evolved to orchestrate these ınterorganı- 
zational relationships (Powell 1996), and the internal structure of 
organizations has changed accordingly, coevolving with the collaborative 
network 

In the early years of the industry, from 1975 to 1987, most DBFs were 
very small start-ups that relied deeply on external support No DBF in 
this period had the necessary skills or resources to bring a new medicine 
to market, thus they became involved in an elaborate lattice-like structure 
of relationships with universities and large multinational firms (Powell 
and Brantley 1992) The large multinational firms, with well-established 
internal career ladders, lacked the cutting edge of university science Lack- 
ing a knowledge base in the new field of molecular biology, the large 
companies were drawn to the start-ups, which had more capability at 
basic and translational science (Gambardella 1995, Galambos and Stur- 
chio 1996) This asymmetric distribution of technological, organizational, 
and financial resources was a key factor in driving early collaborative 
arrangements 1n the industry (Orsenigo 1989, McKelvey 1996, Hagedoorn 
and Royakkers 2002) 

Many commentators at the time argued that these interdependent lınk- 
ages were fragile and fraught with possibilities of “hold-up,” in which one 
party could opportunistically hinder the other's prospects for success ’ 
Some analysts argued that the field would undergo a shakeout Thus large 
pharmaceutical companies would assume dominance, while founding 
rates for new firms would slow to a trickle (Sharpe 1991, Teece 1986) 
But as these observers and others came to recognize, a shakeout did not 
occur, nor did cherry-picking of the most promising firms by larger com- 
panies prove viable ? Instead, the ensuing period saw the give-and-take, 
the mutual forbearance of relational contracting (Macneil 1978), become 
institutionalized as common practice 1n this rapidly developing field By 
the late 1980s, some of the DBFs had become rather large and formidable 
organizations 1n their own right, while many of the big pharmaceuticals 
created in-house molecular biology research programs (Henderson and 
Cockburn 1996, Zucker and Darby 1997) Even as mutual need declined 


' See Holmstrom and Roberts (1998) for a useful review of the hold-up problem and 
a discussion of various alternatives to vertical integration as a solution 

* Hybrıtech was one of the better known DBFs of the mid-1980s This San Diego- 
based firm was purchased in 1986 by the large pharmaceutical company Eh Lilly for 
$300 million Within a year, no Hybritech employees remained with Lilly, meanwhile 
more than 40 firms have been founded by former Hybritech employees (Walcott 2002) 
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as a basis for mterorganızatıonal affiliations, the pattern of dense con- 
nectivity deepened, suggesting the original motivation of exchanging com- 
plementary resources had changed to a broader focus on utilizing mno- 
vation networks to explore new forms of R&D collaboration and product 
development (Powell et al 1996) This is the period we focus on, with 
data from 1988 to 1999 Table 1, which lists the top-selling biotechnology 
drugs ın 2001, illustrates the division of innovative labor that has typified 
the field All 10 drugs were developed by biotech firms, but only five were 
marketed by biotech firms, and just four by their originator In the other 
five cases, a large pharmaceutical company handled or guided the mar- 
keting 1n return for a hefty share of the earnings Comparable data from 
the early 1990s indicate that marketing power and control of revenues 
was dominated overwhelmingly by the pharmaceutical giants (Powell and 
Brantley 1996) A notable feature of drug development 1s that there 1s no 
consumer loyalty to a company and limited brand loyalty as well Combine 
these influences with a market structure that has many winner-take-most 
features, and the outcome 1s a volatile, fast-changing field 

A number of factors undergird the collaborative division of labor in 
the life sciences No single organization has been able to internally master 
and control all the competencies required to develop a new medicine The 
breakneck pace of technical advance has rendered it difficult for any 
organization to stay abreast on so many fronts, thus linkages to univer- 
sities and research institutes at the forefront of basic science are necessary 
(Orsenigo, Pammolli, and Rıccabonı 2001) The high rate of technical 
renewal 1s reflected 1n patent data Figure 1 shows the brisk rise of life 
science patenting and highlights the similarities in the technological tra- 
jectories of universities and biotech firms? Note the parallel climb of 
university and DBF patenting 1n the mid-1980s, a steady ascent for the 
next 10 years and a steep increase in the late 1990s Universities start out 
ahead and then are passed by DBFs ın 1997, but the more important 
point 1s the extent to which both become members of a common tech- 
nological community (Owen-Smith and. Powell 2001a and 20015) This 
joint membership ın a community greatly increases the frequency of ın- 
teraction between universities and industry 

The availability of funding has also increased markedly as biomedicine 
has become a major force 1n modern society The total budget of the US 
National Institutes of Health (NIH), a key funder of basic research that 
allocates approximately 8096 of 1ts budget to external research grants to 
universities and firms, nearly doubled under the Clinton administration, 


? The data are drawn from the National Bureau of Economic Research patent database, 
using our sample of biotech firms and Owen-Smith’s (2003) sample of the most re- 
search-intensive ÜS universities 
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TABLE 1 
ToP TEN BIOTECHNOLOGY DRUGS, 2001 











2001 Sales 
Product Indicated Use (ın Millions) Developer Marketer 
Procrıt Red blood cell enhancement 3,430 Amgen Johnson & Johnson 
Epogen Red blood cell enhancement 2,109 Amgen Amgen 
Intron A/ 
Rebetron Hepatitis C, certain forms of cancer 1,447 Biogen, ICN Schering-Plough 
Neupogen Restoration of white blood cells 1,346 Amgen Amgen 
Humulın Diabetes mellitus 1,061 Genentech Lilly 
Avonex Relapsing multiple sclerosis 972 Biogen Biogen 
Rituxan B-cell non-Hodgkin’s lymphoma 819 IDEC Genentech, IDEC 
Immunex, American Home 
Enbrel Rheumatoid arthritis 762 Immunex Products 
Remicade Rheumatoid arthritis, Chron’s disease 721 MedImmune Johnson & Johnson 
Cerezyme Enzyme replacement therapy 570 Genzyme Genzyme 





NoTE — Data are drawn from Standard & Poor's “Biotechnology,” May 2002 
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FIG 1 Lıfe science patents assigned to biotechnology firms and research universities, 
1976—99 


going from $8 9 billion ın 1992 to $17 08 billion ın 2000 The NIH plays 
a highly significant role in fostering exploration and variety on the research 
front Internal R&D expenditures by biotech and pharmaceutical com- 
panies have also ramped up, from $6 54 billion 1n 1988 to $26 03 bilhon 
ın 2000 ”” Venture capital disbursements, or seed money to biotech start- 
ups, have flowed into biotech, but more irregularly as the public equity 
markets have windows of opportunity when particular technologies are 
in vogue As Lerner, Shane, and Tsui (2003) note, unexpected events 
affecting a single firm—notably the rejection or delay of a drug candidate 
by the US Food and Drug Admınıstratıon—can have pronounced effects 
on all firms’ stock prices as well as their ability to raise capital Conse- 
quently, venture funding of biotech 1s rather episodic, reaching $395 5 
million in 1988, declining over the next three years, then jumping to $586 4 
million 1n 1992, remaining around the half billion level for the next four 
years, then climbing to $1 1 billion in 1997, and staying above $1 billion 
in 1998 and 1999 (National Science Foundation 2002) Biotech financing 
by venture capital 1s also somewhat countercyclical, when there was great 
enthusiasm for Internet and telecom start-ups, interest 1n biotech waned 
In recent years, with the burst of the Internet bubble and a precipitous 
decline in telecommunications, funding for biomedical ventures has been 
on the upswing Biotech firms that are well positioned ın the network 


“ Sources NIH budget, http //www nih gov, 2001, Pharmaceutical R&D spending, 
Pharmaceutical Manufacturers of America annual surveys, http //www pharma org 
(downloaded November 2002) 
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with connections to basic research funding, industrial R&D support, and 
venture capital financing are not only able to obtain money from multiple 
sources, they also develop the capability to interact with varied partici- 
pants These experiences facilitate organizational learning and expand the 
scope and depth of an organization’s knowledge base 

The different members of the field have varying catalytic abilities and 
competencies Some of the participants are quite specialized, while others 
have a hand in multiple activities Figures 2a and 2b provide simple count 
data to illustrate the relationship between functional activity and orga- 
nizational form and suggests how the correspondence of activity and form 
has shifted over time Figure 2a emphasizes an overall pattern of expan- 
sion in number of ties, and for three of the forms of organization, a 
branching out ın terms of activity Growth and diversity go hand-in-hand 
for DBFs, public research organizations (including public and. private 
universities and nonprofit institutes and research hospitals, hereafter, 
PROs), and large pharmaceutical companies Venture capital growth 1s 
notable as well, but as the percentages ın figure 2b reveal, there 1s a strong 
co-occurrence of some forms and activities government agencies specialize 
in R&D, venture capitalists (VCs) ın finance Some organizations, how- 
ever, are able to shift their attention While enlarging the number of ties, 
both dedicated biotech firms and public research organizations broadened 
their range of activities as well Figure 2b, which reports the percentage 
of activity by organizational form, illustrates the pattern of specialization 
by government and VCs, and the diversification by biotech and phar- 
maceuticals, while PROs display a trend toward an R&D and licensing 
model Venture capitalists come to dominate finance, and pharmaceuticals 
master commercial ties, while licensing and R&D are pursued by an array 
of participants 

Finally, as the biotech field gained coherence, and the pattern of reliance 
on collaboration proliferated, institutions emerged to both facilitate and 
monitor the process Offices were established on university campuses to 
promote university technology transfer (Owen-Smith and Powell 20015), 
law firms developed expertise in intellectual property issues ın the lıfe 
sciences, and venture capital firms provided financing along with man- 
agement oversight and referrals to a host of related businesses As these 
relations thickened and a relational contracting infrastructure grew (Pow- 
ell 1996), the reputation of a participant came to loom larger ın shaping 
others' perceptions. Robinson and Stuart (2002) argue persuasively that 
the network structure of the field becomes a "platform for the diffusion 
of 1nformation about the transactional integrity" of 1ts participants. Cen- 
trality ın the network increases the visibility of a participant’s actions 
and reduces the need for more overt, contractual forms of control, such 
as an equity stake or dominance on the board of directors We turn now 
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Percentage of Activity by Organzational Form 1988, ' 


Fic 2b —Duistribution of organizational forms and activities, 1988, 1993, and 1997 (CON 
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to a discussion of the database we have developed to map the evolving 
structure of this field 


DATA AND METHODS 


Our starting point in developing a sample 1s BzoScan, an independent 
industry directory, founded in 1988 and published six times a year, which 
covers a wide range of organizations ın the life sciences field ' Our focus 
is on DBFs We include 482 companies that are independently operated, 
profit-seeking entities involved in human therapeutic and diagnostic ap- 
plications of biotechnology We omit companies involved ın veterinary or 
agricultural biotech, which draw on different scientific capabilities and 
operate ın quite different regulatory climates The sample of DBFs covers 
both privately held and publicly traded firms We include publicly held 
firms that have minority or majority investments ın them by other firms 
as long as the company’s stock continues to be independently traded We 
exclude organizations that might otherwise qualify as DBFs, but are 
wholly owned subsidiaries of pharmaceutical or chemical corporations 
Large pharmaceutical corporations, health care companies, hospitals, uni- 
versities, or research institutes enter our database as partners that col- 
laborate with DBFs Our rationale for excluding both small subsidiaries 
and large, diversified chemical, medical, or pharmaceutical corporations 
in the DBF database 1s that the former do not make decisions autono- 
mously, while biotechnology may represent only a minority of the activities 
of the latter Their exclusion from the primary sample of DBFs eliminates 
serious data ambiguities 

The primary sample covers 482 DBFs over the 12-year period, 1988— 
99 In 1988, there were 253 firms meeting our sample criteria. During the 
next 12 years, 229 firms were founded and entered the database, 91 (of 
the 482) exited, due either to failure, departure from the industry, or 
merger The database, like the industry, 1s heavily centered ın the United 
States, although 1n recent years there has been considerable expansion 1n 
Europe BzoScan reports information on a firm's ownership, formal con- 
tractual hnkages to collaborators, products, and current research Firm 
characteristics reported ın BzoScan include founding data, employment 


" The first volume of BzoScan was released ın 1987 by the biotech firm Cetus, but 
coverage was limited as many firms were reluctant to share data with a competitor 
Oryx Press issued the first independent directory 1n 1988 To supplement BzoScan, we 
consulted Recombinant Capital as well as including various editions of Genetic En- 
gineering and Brotechnology Related Firms Worldwide, Dun and Bradstreet’s Who 
Owns Whom? and Standard &Poor's In addition, we utilized annual reports, Securities 
and Exchange Commission filings and, when necessary, made phone calls to companies 
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levels, financial history, and, for firms that exit, whether they were ac- 
quired or failed The data on ınterorganızatıonal agreements cover the 
time frame and purpose of the relationship Our database draws on 
BıoScan's April 1ssue, ın which new information 1s added for each cal- 
endar year Hence the firm-level and network data are measured during 
the first months of each year We define a collaborative tie, or alliance, 
as any contractual arrangement to exchange or pool resources between a 
DBF and one or more partner organizations We treat each agreement as 
a tie and code each tie for 1ts purpose (eg, licensing, R&D, finance, 
commercialization) and duration Some ties involve multiple stages of the 
production process All such ties include commercialization activities, such 
as manufacturing or marketing, hence we code complex agreements as 
commercial ties We say a connection, or link, exists whenever a DBF 
and partner have one or more ties between them 

We seek to explain the processes that attract two parties to one another, 
the evolving patterns of tie formation and dissolution, and the overall 
structure of the network We code the dominant forms of partner orga- 
nizations into six categories, representing those that populate the field 
public research organizations, large multinational pharmaceutical cor- 
porations (as well as chemical and diversified health care corporations), 
government institutes (such as the National Cancer Institute or the Institut 
Pasteur), financial institutions (principally venture capital as well as banks 
and insurance companies), other biomedical companies (providers of re- 
search tools or laboratory equipment), and those DBFs that collaborate 
with other biotech companies There are more than 2,300 non-DBFs in 
the partner database 

The four types of ties involve different activities, ranging from basic 
research to finance to licensing intellectual property to sales and mar- 
keting Thus, the matrix of organizational forms and activities 1s 6 x 4, 
or 24 possible combinations of partner forms and functional activities of 
ties Some of the cells are quite rare, but there are cases 1n every cell 
Some of these activities involve the exchange or transfer of rights, while 
others require sustained joint activity The latter obviously entails more 
integration of the two parties to a relationship This difference 1s one 
reason we treat the four types of activities separately in most analyses 

Given the differentiation of organizational forms and types of ties, our 
approach has some limitations In some, though not all, of our measures, 
we treat the type of tie or the form of the partner as equally important 
Obviously, this 1s not altogether realistic, indeed, 1f the analyses were 
based on only a single year of data, this limitation would loom large The 
benefit of this assumption 1s that 1t permits comparisons across time pe- 
riods There 1s also heterogeneity in the nature of participation of different 
organizational forms for a particular type of activity At the extreme, an 
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R&D partnership between a global pharmaceutical company and a DBF 
may reach the scale of $1 billion, while a DBF’s R&D alliance with a 
university laboratory may involve as little as $100,000 or $200,000 The 
salience of these limitations recedes as we add more years of observations 
to the data set The advantages of 12 years of fine-grained data reside ın 
capturing the length of relationships, the dissolution of ties to particular 
partners and the forging of ties to others, as well as the deepening of some 
ties Issues of scale are assumed to be constant while we examine duration 
of ties and the extent to which the parties involved 1n a relationship share 
other partners 1n common at specific points 1n time This approach allows 
us to speak to Salancik’s (1995, p 348) concern that network analysis 
should show how adding or subtracting a particular interaction ın a net- 
work changes the coordination among the participants and either enables 
or discourages interactions between parties 

We do not collect data on the ties among the non-DBF partner orga- 
nizations In some cases, such ties would be very sparse or nonexistent 
(e g , venture capital funding of universities or pharmaceutical companies), 
ın other cases, they are more common (eg, pharmaceutical support of 
clinical trials conducted at a university medical center) ” The practical 
problem 1s that the data on a network of 2,310 x 2,310 disparate orga- 
nizations would be very difficult to collect Thus, we analyze the con- 
nections that DBFs have to partners, and the portfolio of DBFs with 
whom each partner 1s affiliated To do this we use k-components to identify 
cohesive subsets of organizations " This measure of multiconnectivity 


> Our collaborators Fabio Pammolli and Massimo Rıccabonı at the University of 
Florence have constructed a large database on R&D projects in the biomedical field 
that covers more than 10,000 external collaborations among participants m the life 
sciences throughout the decade of the 1990s The most frequent type of partnership 
(approximately 45%) was between a biotech firm and a pharmaceutical company The 
least common affiliation was between a pharmaceutical company and a public research 
organization (05%) 


“ We are indebted to James Moody who applied to our network data his algorithm 
for &-components (Moody and White 2003) as our measure of cohesion A k-component 
1s a potent measure of cohesion because such a structure cannot be disconnected except 
by removal of k or more nodes In contrast, a maximal subgraph (known as a k-core) 
in which all nodes have & degree or higher may be a disconnected graph (White and 
Harary 2001) The computation of k-components as units of cohesion showed a perfect 
(1 0) correlation for each time period between k-components and maximal &-core su- 
bgraphs This result points to a highly cohesive network, with a well-connected core 
at the center Save for a few small bicomponents on the periphery, there were singular 
rather than multiple overlapping k-components at each level (eg, at each level of 
8,7,6,5,4, and 3), and each unique k-component was embedded 1n a k-component at 
the next lower level of cohesion Organizations ın the eight-component for each time 
period, for example, were embedded 1n the seven-component, and so on down the 
stack K-components are necessarily embedded, but they need not form a single hi- 
erarchy Distinct hierarchies of k-components may form with as many as k-1 overlap- 
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does not require complete data on relations among nonbiotech partners, 
thus allowing us to analyze the total network of nearly 2,800 organizations 
Our focus, then, 1s on cohesion, mediated exclusively by ties with DBFs 

Network measures such as the analysis of degree distributions, unlike 
k-components, require complete data To this end, 1n the first and third 
of the analyses that follow, we separate our database into two parts the 
complete one-mode network (482 x 482) of ties among DBFs, where we 
also have extensive data on the attributes of the 482 biotech firms, and 
the two-mode network that consists of complete data on the ties of DBFs 
to non-DBF organizations 

Over the past decade, we have also interviewed more than 200 scientists 
and managers ın biotech and pharmaceutical firms, as well as university 
professors who are actively involved 1n commercializing basic research 
Members of our research group have done participant observations in 
university technology licensing offices, biotech firms, and large pharma- 
ceutical companies Students working on the project have developed a 
large data set on the founders of biotech companies and analyzed the 
careers of scientists who joined biotech companies In short, while the 
analyses presented here are based on data derived from industry sources, 
our intuition about the questions to address are grounded 1n primary data 
collection 


Analysis 1 Degree Distributions 


Research on networks in the graph theoretic and statistical mechanics 
traditions often utilizes degree distributions as a diagnostic indicator of 
whether tie formation in a network (growth or replacement) 1s equiprob- 
able (simple random) for all pairs of nodes or biased proportional to 
existing ties of potential partners The degree of each node 1s measured 
as the number of other nodes directly connected to the focal node Pref- 
erential attachment to already connected nodes 1s referred to as a pop- 
ularity bias Unlike the tail of a random bell curve whose distribution 
thins out exponentially as 1t decays, a distribution generated by a pop- 
ularity bias has a “fat” tail for the relatively greater number of nodes that 
are highly connected The fat tail contains the hubs of the network with 
unusually high connectivity 

Different types of degree distributions can be distinguished when plot- 
ted on a log-log scale, with log of degree on the x-axis and log of the 
number of nodes with this degree on the y-axis The degree distribution 
for a network in which the formation of edges 1s governed by a popularity 


ping nodes 1n common Such was not the case for the cohesive biotech network (White 
et al 2004) 
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bias, where nodes with more connections have a higher probability of 
recelving new attachments, would plot as a straight line on the log-log 
graph, indicative of a power law * A power-law degree distribution is not 
sufficient by itself, however, to identify the actual mechanisms that fa- 
cilitate tie formation A power-law degree distribution can reflect not only 
preferential attachment by 1ncumbency (degree of attracting node) but 
also preferences for attractiveness, legitimacy, diversity, or a concatenation 
of mechanisms In our study, we test for the existence of a preferential 
attachment process for each year to the next rather than assume its ex- 
istence We use multivariate analysis to try to discern specific mechanisms 
that govern tie formation and the type of degree distributions that different 
substantive processes generate Before we begin that analysis, however, 
it 1s useful to examine the degree distributions 

In figure 3, we plot the aggregate degree distributions of DBFs on log- 
log scales for the six types of partner organizations other DBFs, phar- 
maceutical, chemical, and diversified health-care corporations, universi- 
ties, nonprofit research institutes and hospitals, government agencies, 
venture capital firms and other financial entities such as banks, and bio- 
medical companies that supply research tools and instruments For each 
of the six plots, the x-axis reflects log-degree (aggregated over all time 
periods) and the y-axis the log of the number of partners of the plotted 
form having a given network degree (also aggregated over all time pe- 
riods) A power-law distribution, as noted, would plot as a straight line 
For a network of sufficient density, an exponential-decay degree distri- 
bution, mimicking the results of a simple random process of tie formation, 
would form a convex curve that bows to the upper right away from the 
origin 1n the log-log plot An exponential distribution can be statistically 
rejected based on the data presented in figure 3, along with a simple- 
random attachment process that would generate degree distributions that 
decay exponentially 5 

The least squares fitted linear slopes for the log-log plot of the degree 
distributions ın figure 3 are ın the expected range for power laws, between 


1 A power law 1s a mathematical expression for a distribution that 1s unhke a normal 
bell curve with a peak in the middle, where most nodes have a similar number of ties 
A scatterplot or histogram plotting power-law decay 1s a continually decreasing curve 
where values on the Y axis approximate a linear function of those on the X axis raised 
to a fixed power a, hence log y = C + alogx where C 1s the intercept constant 


5 The exponent ın this function ıs not fixed, but varies with values on the X axis 
Hence, the exponential y — 5 -- C* 1s linear in log y and x, but 1s bowed outward in 
a log-log plot Inequality resulting from a simple random process of tie formation 1s 
accidental, or, ın the case of network growth, ıt may be due to early entry Networks 
with exponential degree distributions do not have hubs that are extreme outhers in 
terms of very high degree 
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FıG 3 —Degree distributions by type of partner, all years 


1 1 and 2 7, over four orders of magnitude of variation ın degree Only 
the left half or low-degree half of one of the distributions, that for gov- 
ernment agencies, has a slope anywhere near three, as expected from pure 
preferential attachment for large networks (Barabâsı and Albert 1999, 
Bollobás and Riordan 2002) This government degree distribution 1s ın- 
teresting because of the highly pivotal central role of the NIH, which 1s 
a key funder of basic research The NIH 1s the most active partner in 
the entire network That the slopes of the five other distributions are 
considerably less than three may be an indication that processes other 
than pure preferential attachment are operative *° 


“ Whether there are attachment processes in the extremes of several of the graphs that 
go beyond power-law attachment biases cannot be determined due to the very small 
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Although these degree distributions are aggregate measures over all 
time periods, they give some hint of growth processes ın attachment The 
shape of the distributions mimics what would be expected for tie formation 
in the biotech network from a process of preferential attachment to degree, 
but power-law slopes that are closer to one than to three suggest that 
other substantive processes are operating These static snapshots of the 
degree distributions, however, do indicate that the importance of different 
organizational forms varies with respect to patterns of affihation To ex- 
plore the dynamics of the field and the changing impact of different 
organizations, we next present a series of network visualizations, followed 
by a statistical examination of the actual attachment processes 


Analysis 2 Discrete-Time Network Visualizations 


We utilize Pajek, a freeware package for the analysis and visualization 
of networks, to present a series of discrete-time images of the evolution 
of the biotechnology network Pajek employs two powerful minimum 
energy or “spring-embedded” network drawing algorithms to represent 
network data 1n two-dimensional Euclidian space These algorithms sim- 
ulate the network of collaborations as a system of interacting particles, 
in which organizational nodes repel one another unless network ties act 
as springs to draw connected nodes closer together Spring-embedded 
algorithms iteratively locate a representation of the network that mını- 
mızes the overall energy of the system, by reducing the distance between 
connected nodes and maximuzing the distance between unconnected 
nodes (For more on Pajek visualizations, see app A) 

We generate two sets of images for the time period covered by our 
database To simphfy the presentations, we include only those organi- 
zations 1n the main component each year, thereby removing the isolates 
from this large and expanding network *’ The first set of 1mages presents 
the full collaboration network, while the second set represents the new 
ties added each year (Given space constraints here, we select a subset of 


numbers of organizations with high degree A degree distribution that plots as a concave 
curve, bent in toward the origin on a log-log scale, might indicate a super-power-law 
process in which more complex rules of attachment are operating and power-law 
inequality ıs accelerated With one small exception, hardly systematic, there ıs no 
evidence ın fig 3 for a super-power-law process 


7 The main component 1s the largest connected cluster in the network It eliminates 
both isolates and small disconnected clusters Most network measures are based on 
the main component, which 1s a connected graph for which measures can be generated 
In substantive terms, the main component 1s the largest subset of organizations that 
can reach each other through direct and indirect paths The percentage of organizations 
connected to the main component is 85 4% ın 1988, dips slightly to 80 3% in 1992, 
and rises to 98 6% by 1999 
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these pictures for inclusion in the article, but we urge readers to view the 
complete set 1n the online version of AJS ) The visualizations afford a 
multifaceted view of network evolution, including growth 1n the number 
of participants, changes 1n the purpose of ties, and the formation of new 
ties 

The Pajek figures are designed to reflect visually our hypothesized 
models of attachment The graphics cannot adjudicate between the var- 
10us models, which we do in statistical analyses below, but they do provide 
suggestive evidence, or existence proofs, if you will Consider what the 
full network and new tie visualizations might look like if strong versions 
of our four mechanisms were operating After a sketch of these stylized 
images, we then turn to the data and examine a series of empirical vi- 
sualizations If an accumulative advantage process drives attachment, the 
spring-embedded images would display a preponderance of star-shaped 
structures attached to large nodes These stars would be positioned at the 
center of the 1mage and continue to grow rapidly over time as newcomers 
(triangles) affiliate with the most-connected incumbent nodes We scale 
the size of nodes to represent the number of ties an organization has 
Under a rich-get-richer model, we would expect few small nodes to have 
numerous new partners, and many large nodes would be situated at the 
network’s center In contrast, 1f homophily strongly conditions tie for- 
mation, then we would anticipate images differentiated into coherent and 
loosely interconnected single-color clusters These homophilous clusters 
should be fairly dense and dominated by characteristic organizational 
forms without necessitating a preponderance of stars tethered to large 
central nodes New entrants (pictured as triangles) would move into the 
neighborhoods that most closely matched their type and profile 

Were a follow-the-trend logic dominant, new ties would be overwhelm- 
ingly uniform in color and the predominant color should reproduce the 
previous year’s pattern of affiliation If, for instance, R&D (red) ties dom- 
inated in a prior year, neophytes would generally enter the network 
through this established route and the mages would be fields of red A 
preference for diversity of partners and pathways implies less visual co- 
herence than the other mechanisms Multiconnectivity coupled with het- 
erogeneous activities would show clusters 1n which all four colors would 
be evident in the springs The nodes at the center of the network should 
be different colors, reflecting the array of organizational types Some small 
nodes should be noticeable at the center of the field, while some large 
nodes should have few new partners New entrants should be sprinkled 
throughout the network Empirical reality 1s, of course, more messy than 
stylized models We use these thought experiments to make the detailed 
visualizations of this large database more interpretable to readers not 
familar with graphical representations of network dynamics 
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Figure 4, showing the main component ın 1988, serves as the starting 
point The color of the nodes reflects their organizational form, with light 
blue a DBF, yellow a large pharmaceutical, chemical, or diversified med- 
ical corporation, and brown a government institute or agency In subse- 
quent years gray nodes become important, reflecting the growing imprint 
of venture capital The springs are colored according to the functional 
activity reflected by a tie, with red an R&D partnership, magenta a h- 
censing agreement, green a financial relationship, and dark blue indicating 
an alliance involving one or more stages ın the commercialization process, 
ranging from clinical trials to manufacturing to sales and marketing Node 
size 1s scaled to standardized network degree in the total network, re- 
flecting variation ın the extent of degree connectivity among the orga- 
nizations To return to our earlier metaphor of a dance, the representation 
captures dancers with different identities who may participate in different 
types of dances with one or many partners The size of a node, given the 
power-law tendencies 1n the degree distributions, might reflect a rich-get- 
richer process In the images that follow, particularly the visualizations 
of new ties, there are several large blue DBF nodes that are clearly the 
most attractive stars of the network In later years, there are nodes that 
have multiple hnkages for different activities, reflecting a. preference for 
diversity Looking at the overall population rather than specific nodes, 
we observe shifts in the dominant activities as well as changes in the 
composition of the nodes, which illustrate the overall trends ın the field 

Several key features stand out for 1988 ın figure 4 The predominant 
color 1s blue, and the most active participants are biotech firms, phar- 
maceutical companies, and several government agencies The strong 1m- 
pact of commercialization ties 1s a clear indication of the dominant strategy 
of mutual need that characterized the industry's early years Biotech firms 
lacked the capability to bring novel medicines to market, while large firms 
trailed behind ın understanding new developments ın molecular biology 
(Gambardella 1995, Powell and Brantley 1996, Henderson, Orsenigo, and 
Pisano 1999) Finance ties (green) are less prevalent and only a few venture 
capital firms (gray) are present, providing further evidence that most 
DBFs supported themselves by selling their lead product to large cor- 
porations, who subsequently marketed the medicine and pocketed the 
lion’s share of the revenues Clustered in the center are red (R&D) and 
magenta (licensing) ties, which show that DBFs with significant ıntellec- 
tual property and strong research capability are highly sought for collab- 
oration The large multiconnected nodes 1n the center of the representation 
are a small group of established, first-generation DBFs, major multina- 
tional firms, and government institutes (the NIH and the National Cancer 
Institute) In the detail shown 1n the lower right-hand corner of figure 4, 
we identify a handful of the largest nodes NIH, which will serve as an 
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orienting node 1n the full network figures because of its centrality, and 
NCI, a group of first-wave biotechs founded 1n the 1970s and early 1980s 
(Genentech, Centocor, Amgen, Genzyme, Biogen, and Chiron, which be- 
came the largest and most visible firms by 1988) and three multinational 
firms (Eastman Kodak, Johnson & Johnson, and Hoffman La Roche) 
The close proximity of the Swiss firm Hoffman La Roche to Genentech 
is interesting because 1n 1990, the Swiss firm became the majority stock- 
holder of Genentech Kodak & Johnson and Johnson reflect the broad 
interest in biotech by a range of large firms 1n different technology sectors 
Kodak soon drops out of the center of the field, as does Johnson & Johnson, 
but the latter returns ın the late 1990s by acquiring two companies, first 
Centocor and then Alza, both estabhshed, well-connected DBFs Kodak's 
subsequent loss of centrality and Johnson & Johnson's purchase of two 
incumbent DBFs illustrates that a first mover, rich-get-richer account does 
not always hold, even for some of the early resource-rich participants 

In figure 5, we present the new ties added ın 1989 To return to the 
dance metaphor, the music has stopped temporarily while new partners 
are being chosen We add shape to the presentation, with triangles rep- 
resenting new entrants to the network, while circles are the incumbents 
Note the very active role of NIH (the largest brown node) 1n forging R&D 
ties with new entrants, and the appearance of many grey triangles, ıllus- 
trating the growing involvement of venture capital in financing DBFs 
Node size continues to be scaled for network degree 1n the prior year's 
full network, thus graphically representing how network position 1n one 
year may condition the addition of new ties ın a subsequent year The 
distribution of node sizes might indicate the importance of rich-get-richer 
models of attachment In the initial years, we see visual affirmation of 
the positive effect of number of previous ties on new tie formation Note 
that several large blue DBF nodes are at the center of the new tie network 
The dominance of commercialization springs (blue) 1n 1988 recedes mark- 
edly in the 1989 new ties picture R&D and finance are the main avenues 
generating growth 1n the network Observe that large nodes with just a 
few ties have little diversity 1n their partners, while some of the smaller 
nodes adding many new ties have a wide variety of partners 

We fast forward to 1993, but encourage readers to follow the annual 
changes ın the biotech field represented by the visualizations available 
on the ATS Web sıte * Figure 6 portrays a large expansion ın activity, 


18 We use annual changes as a matter of convention as publicly traded firms routinely 
provide accounts of their activities on a yearly basis and various data sources are 
organized 1n this manner We have spent a good deal of energy studying the sequences 
of partners and activities, and have analyzed the time ın monthly intervals to key 
events, e g , first tie, first R&D collaboration, going public, etc Because the data are 
more reliably reported annually, we use year-to-year changes ın the visualizations 
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Fic 5 —1989 main component, new ties 


with green ties (finance) now much more prominent We also observe a 
shift 1n the composition of the most connected participants Put collo- 
quially, the music has changed—from commercialization to finance, and 
accompanying the shift in the predominant collaborative activity 1s both 
an increase 1n the number of highly connected nodes (there are roughly 
three times as many larger nodes as ın 1988) and the march-in of gray 
(venture capital) and orange (university) nodes This shift in the primary 
locus of activity 1s important on a number of dimensions Finance, as 
opposed to commercialization, has a powerful mobilizing effect, enrolling 
new types of actors (VC, and subsequently investment banks, pension 
funds, university endowments, etc ) in financing the expansion of the field 
As we have shown elsewhere, the locus of venture capital-financed biotech 
start-ups was initially the Bay Area and Boston, but by the end of the 
1990s had spread to a number of key regions in the United States and 
Europe (Powell, Koput, Bowie, and Smith-Doerr 2002, Owen-Smith et 
al 2002, Sorenson and Stuart 2001) Thus growth 1n the number of new 
firms, new partners, and new ideas 1s enhanced by an increase ın financial 
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linkages, signaling the important role of the public equity markets in 
fostering growth In contrast, commercialization 1s a more restrictive ac- 
tivity The ability to manufacture new biomedical products was a rela- 
tively scarce skill, as was the ability to market and distribute a new 
medicine throughout the world A relatively small number of large firms 
had these capabilities, and ıt took at least a decade before DBFs developed 
these skills Consequently, during the first two decades of the field, com- 
mercialization ties flowed through a small set of dominant multinationals 
and a handful of established biotech firms Thus, not only were the num- 
ber of participants limited, commercialization 1s a “downstream” activity, 
indeed, when it involves the sale of a new medical product, 1t 1s the “last 
dance” ın the product life cycle Finance, ın contrast, is an “upstream” 
activity, which, 1n turn, fuels R&D, licensing, and commercialization, and 
thus enrolls more participants ın the industry network 

The organizational composition of the center of the field has shifted as 
well Two research universities, MIT and Harvard, along with a handful 
of leading VC firms, are now at the center The composition of multi- 
national firms shifts from diversified chemical and medical companies to 
some of the giants of the pharmaceutical sector, for example, Schering- 
Plough and Merck The diversity of organizational forms at the center of 
the network ıs notable ın that these varied organizations—DBFs, phar- 
maceuticals, VCs, research universities, and government institutes—op- 
erate in distinct selection environments, subject to very different pressures 
and opportunities Networks anchored by diverse organizational forms 
are more robust to both failure and attack than homophılous structures 
(Albert, Jeong, and Barabâsı 2000, Owen-Smith and Powell 2004) Such 
diversely anchored, multiconnected networks are less hkely to unravel 
than networks rehant on a single form of organization for their 
cohesiveness 

The picture of new ties ın 1994 (fig 7) reflects growing complexity ın 
the activity sets of the participants On the right side of the network, 
finance 1s very pronounced, and there are many more gray nodes, which 
are growing 1n size But note there are now yellow nodes (which are not 
triangles, so these are not new entrant pharmaceutical companies) 1n- 
volved in financing smaller biotech firms On the left side of the figure, 
we see blue and green ties linked to well-connected DBFs At the center 
of the network 1s the NIH, the anchor of R&D activity and the largest 
node, linked to both small as well as large DBF nodes The picture of 
new ties 1n 1994 illustrates the growing multivocality of the industry, with 
both well-connected DBFs and pharmaceuticals developing the capability 
to finance younger firms, contribute to basic and clinical science, and 
commercialize new medicines The overall picture has shifted from one 
in which commercialization and rich-get-richer were the dominant scripts 
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to one 1n which finance 1s generating much more diversity 1n activity and 
there 1s more heterogeneity ın the makeup of the key participants With 
greater numbers of new ties added ın 1994, the more treelike structure 
of springs 1n 1989 has now changed to reflect multiconnectivity, that 1s, 
a more cohesive structure even among the new partnerships 

The density of the field and the number of participants continue to 
grow throughout the 1990s The picture of all ties 1n the main component 
in 1997 (fig 8) illustrates a growing number of large nodes, strong ex- 
pansion of collaborative activity and a reaching out to new entrants, and 
more varied types of organizations at the center of the figures In 1997 
(fig 8), we see a very cohesive hub of DBFs, pharmaceuticals, venture 
capital firms, universities, and the NIH complex The colors of the ties 
are less discernable, reflecting the fact that the most active members of 
the network are now either engaged ın multiple activities or connected 
to DBFs who are The pharmaceutical sector underwent a period of con- 
solidation 1n the mid to late 1990s, as mergers and acquisitions became 
commonplace Novartis, formed out of the merger of the two large Swiss 
firms Sandoz and Ciba Geigy, and Glaxo Wellcome, the product of the 
acquisition of Burroughs Wellcome by fellow British giant Glaxo, are in 
the center of the network Outside the center there are several specialist 
DBFs, one with a very active commercialization portfolio and another 
with a hcensing cluster İn the new ties image for 1998 (fig 9), finance 
continues to be generative in pulling 1n new entrants For the first time, 
a handful of pharmaceutical giants, bolstered by a round of mergers and 
acquisitions, are central 1n the new tie network While the NIH remains 
in the core, 1t 1s no longer clearly the largest node Many more DBFs are 
found 1n the center and on the edges of the new tie network, reflecting 
the growing presence of second-generation firms who are active ın the 
field The consolidation 1n the pharmaceutical sector 1s producing a rich- 
get-richer effect among the largest multinationals, but these survivor cor- 
porations have learned to do more than just commercialize the lead com- 
pounds of the smaller DBFs The big multinationals have become 
multivocal Meanwhile, a combination of DBFs, universities, and gov- 
ernment institutes are active in pulling in new participants 

The full network image for 1997 (fig 8) shows a notable clustering of 
financial ties on the right side, with connections to smaller-size DBF nodes 
This shift underscores the cyclical nature of venture capital, which in- 
volves taking a firm public, ending that relationship and moving on to 
finance new firms The number of yellow nodes at the center has de- 
creased, as consolidation shrinks the number of big pharmaceutical firms 
In turn, the size of each yellow node increases, as their portfolio of alliances 
grows and diversifies Recall that the NIH’s budget for R&D grew mark- 
edly throughout the 1990s, and 1t remains hugely important as a funder 
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FiG 8 —1997 main component, all ties 


of basic research and as a participant 1n licensing the results of 1ntramural 
NIH research done The new tie network ın 1998 (fig 9) ıs the most 
expansive yet, with more than 1,100 new ties added Multivocahty 1s the 
dominant pattern here, as nearly every large node 1s engaged ın multiple 
types of collaborations 

Pajek visualizations, which we use to show the extent of clustering as 
well as type of organization, type of tie, and network degree, can be 
considered to provide a visual goodness-of-fit test for the weighting of 
our hypotheses As a supplement to the graphics, we provide count data 
on patterns of entry and exit into the network We see ın table 2 that the 
number of participants, both DBFs and partners, grows every year But 
the rate of expansion for total number of ties and new ties outpaces the 
entry of organizations, suggesting a more connected field The visuali- 
zations afford the opportunity to see the diverse types of organizations 
that are driving this connectivity ? The rate of tie dissolution grows, then 
wanes, then heads up, so there 1s considerable turnover in interorgani- 


?? There ıs a falloff ın new ties and partner entry in 1999, which reflects incomplete 
reporting 1n 1999 annual reports We have since added four more years of data and 
observe a downturn ın 1999-2001 and a notable uptick ın 2002 and 2003 
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zational relations, reflecting both successful completion of some projects 
and dissolution due to lack of progress on others The general picture 1s 
one of a continuing flow of new entrants into the field, alongside the 
forging of new collaborations, making for an increasingly dense network 
To test our hypotheses, as well as the insights derived from the visuali- 
zations, we turn to a more fully specified analysis 1n which we take or- 
ganizational variables, network portfolios, and time periods into account 
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YEAR 


1988 
1989 
1990 
1991 
1992 
1993 
1994 
1995 
1996 
1997 
1998 
1999 
All years 


In 


155 
181 
199 
210 
233 
265 
297 
316 
340 
351 
360 
363 
450 


1 


New 
Network Entrants 


29 
28 
18 
31 
35 
35 
24 
34 
14 
13 
12 
273 


TABLE 2 


Non-DBF 
PARTNERS 
In New 
Network Entrants 
579 
672 156 
747 146 
792 119 
800 123 
873 149 
938 119 
985 141 
1,058 165 
1,172 201 
1,313 251 
1,332 122 
2,265 1,692 
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NOTE — Thirty two biotech firms never have any ties We include these firms ın subsequent : 
we exclude 45 organizations from the non-DBF partner list because we cannot determine their | 
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Analysis 3 Attachment Bias 


We now turn to a statistical analysis of attachments between DBFs and 
various partner organizations For analytic purposes, we distinguish be- 
tween four categories of relationships 


1 New one-mode attachments, in which a DBF contracts with an- 
other DBF as a partner and 1s the first tie between this dyad 

2 Repeat one-mode attachments, 1n which a DBF contracts with 
a DBF partner and 1s not the first tie between this dyad 

3 New two-mode attachments, ın which a DBF contracts with a 
non-DBF partner and 1s the first tie between the dyad 

4 Repeat two-mode attachments, ın which a DBF contracts with 
non-DBF partner and is not the first tie between the dyad 


New attachments expand the structure of the network, whereas repeat 
ties thicken relations between existing dyads One-mode attachments cre- 
ate a cooperative structure among competing biotech firms, while two- 
mode attachments engage different organizational forms to access re- 
sources and skills Each relationship involves a focal DBF, a partner to 
which the attachment occurs, and a set of alternative partners with whom 
the DBF might have collaborated but did not We refer to the set of 
partners to which the focal DBF might link for a particular observed 
attachment, including the partner to which the connection occurred, as 
the rzsk set for that attachment For new one-mode attachments, the risk 
set 1s other DBFs, excluding those to which the focal DBF 1s currently 
or has previously attached For repeat one-mode attachments, the risk set 
15, conversely, those current or prior partners of the attaching DBF For 
new two-mode attachments, the risk set 1s partners other than DBFs, 
excluding those to which the attaching DBF 1s currently or has previously 
been linked For repeat two-mode attachments, the risk set 1s, again con- 
versely, all current or prior partners (other than DBFs) of the focal DBF 


MEASURES 


We draw upon multiple measures to test each hypothesized mechanism 
across four classes of ties We present and define the variables used ın 
our statistical analyses, along with appropriate controls, 1n table 3 Our 
dependent variables are binary indicators of tghwhether an alliance of 
each of the four types occurs between a DBF and a partner, given that 
the DBF and partner are “at risk" of attaching A DBF may forge con- 
nections to more than one partner in any given window of time, as long 
as alliances can be ordered such that a partner for which a linkage occurs 
at a specific moment 1s removed from (added to) the risk set for subsequent 
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VARIABLES IN STATISTICAL TABLES 
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Unit of 
Label Observatıon Description 
Dependent variables 
New attachment Tie Computed as a binary indicator of whether an attachment occurs 
between a DBF and a partner for the first tıme 
Repeat attachment Tie Computed as a binary indicator of whether an attachment occurs 
between a DBF and a partner, other than for the first time 
Independent variables 
Accumulative advantage 
Firm degree DBF No of ties of the DBF just prior to the attachment 
Partner degree Partner No of ties of the partner just prior to the attachment 
Firm experience DBF No of years since inception of DBF’s first tie 
New partner Partner Indicates whether this 1s the partner's first year ın the network 
Partner experience Partner No of years since inception of partner’s first tie 
Prior ties Dyad No of prior ties connecting the DBF-partner dyad 
Prior experience Dyad Duration since first tie connecting the DBF-partner dyad 
Homophıly 
Collaboratıve dıstance Dyad One-mode Euclıdean dıstance between the actıvıty-type-by-partner- 
form profiles of the attaching DBF and partner, just prior to the 
attachment Í 
Collaborative distance Dyad Two-mode Euclidean distance between the activity-type profiles of 
the attaching DBF and partner, just prior to the attachment 
Age difference Dyad One-mode absolute difference ın age between the attaching DBF 
and the partner, at time of attachment 
Suze difference Dyad One-mode absolute difference 1n number of employees between at- 


taching DBF and the partner, at time of attachment 


69IT 


Governance Similarity 
Co-location 


Partner's partner collaborative distance 


Partner's partner age difference 


Partner's partner size difference 


Partner’s partner governance sımılarıt 
g y 


Partner's partner co-locatıon 


Follow-the-trend 
Dominant trend 


Dominant type 


Multiconnectivity 
Firm cohesion 


Partner cohesion 


Shared cohesion 


Dyad 
Dyad 


Partner's 
neighbor- 
hood 

Partner's 
neighbor- 
hood 

Partner’s 
neighbor- 
hood 

Partner's 
neighbor- 
hood 

Partner's 
neighbor- 
hood 


Field 


Partner 


DBF 
Partner 


Dyad 


One-mode dummy variable capturing whether both firms are pub- 
licly traded or privately held, at time of attachment 

One-mode indicator of whether the attaching DBF and partner are 
in the same region, at time of attachment 

Average Euclidean distance between activity-type-by-partner-form 
profiles of attaching DBF and other DBFs attached to the partner 


Average absolute difference ın age between the attaching DBF and 
other DBFs attached to the partner 


Average absolute difference 1n number of employees between the at- 
taching DBF and other DBFs attached to the partner 


Average absolute difference in whether publicly held between the at- 
taching DBF and other DBFs attached to the partner 


Average of indicator of whether the attaching DBF and other DBFs 
attached to the partner are 1n the same three-digit zip code region 


% of other attachments up to the time of the attachment that are ın 
same activity-type-by-partner-form category 

% of a partner's ties that fall into the same activity-type category as 
the activity type of the attachment 


DBF's maximum value of k for which the DBF is in a &-compo- 
nent, just prior to the attachment 

Partner's maximum value of k for which the partner 1s ın a k-com- 
ponent, just prior to the attachment 

Maximum value k of k-components occupied by both partner and 
DBF, just prior to attachment 


TABLE 3 (Continued) 











Unit of 
Label Observation 
Firm tie diversity DBF Blau heterogenu 
portfolio, just 
Partner tie diversity Partner One-mode Blai 
partner-form 
Partner tie diversity Partner Two-mode Bla 
portfolio, jus! 
Prospective diversity Dyad Change in DBE 
Partner’s partner collaborative diversity Partner’s Average tie div 
neighbor- of activity-ty 
hood 
Controls 

Period Field Period = 1 19: 
Timeline Field Linear time tre: 
Age DBF Duration in yea 
Size DBF No of employe 
Governance DBF Indicates whetl 
Location DBF Three-digit zip 
Form Partner Two-mode ind 
corp, univers 

for-profit 
Type Dyad Indicates type « 


research, fina 
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new (repeat) attachments The predictor variables operationalize the at- 
tachment mechanisms for each of the four hypotheses accumulative ad- 
vantage, homophily, follow-the-trend, and multiconnectivity For each 
observed attachment, partner and dyad measures are computed for all 
partners 1n the risk set 

Accumulative advantage 1s reflected 1n the network degree and expe- 
rience of both the attaching DBF and partner, which captures both the 
number of ties and years of experience with collaborations. For repeat 
ties, we also include prior tzes and prior experience of the DBF-partner 
dyad We also use new partner, an indicator of whether a partner has 
been ın the network for less than a year, to capture the effect of being “a 
new kid on the block ” 

Homophily is assessed 1n several ways We measure the collaborative 
distance between the alhance profiles of the DBF and the partner 1n the 
attaching dyad For one-mode attachments, we measure the age difference, 
size difference, and governance similarity between the two DBFs forming 
the dyad, as well as measuring co-locatıon For two-mode attachments, 
we lack data to measure partner's age or szze difference, and governance 
simalarity does not apply to noncommercial organizations We did measure 
homophily, however, between the attaching DBF and other DBFs in the 
partner’s neighborhood, defined as the set of DBFs to which that partner 
has direct ties, with the variables collaborative distance, age difference, 
sıze difference, governance similarity, and co-location, all considered with 
respect to a partner’s other affiliations These “second order” measures 
account for the possibility that connections to partner organizations are 
conditioned by the prior experience of those partners with other DBFs 

Follow-the-trend 1s captured by the field-level variable domznant trend 
and the partner variable dominant type Both measures reflect whether 
firms are engaging ın activities that are comparable to those of others ın 
the field Mulizconnectivity has two facets cohesion and diversity The 
former captures the extent to which firms are connected by multiple 1n- 
dependent pathways, while the latter reflects whether firms are engaged 
in multiple types of activities Coheszon 1s calculated using the maximum 
level of k-component for each firm and partner, measured separately as 
firm cohesion and partner cohesion, and jointly as shared cohesion, which 
is the maximum level for which both parties share a common k-compo- 
nent, 1f any ^ For diversity, we use Blau’s (1977) heterogeneity measure 


“0 More precisely (White and Harary 2001, pp 12-14), “The (xode-) connectivity k(G) 
is defined as the smallest number of nodes that when removed from a graph G leave 
a disconnected subgraph or a single node À maximal connected subgraph of G 
with connectivity k > 0 1s called a k-component of G, with synonyms component for 
1-component, bicomponent for 2-component, tricomponent for 3-component, etc A 
cohesive block of a graph G 1s a k-component of G where the associated value of 
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as an index of both the range of activities and multiple types of partners 
We measure diversity ın four ways for a DBF (firm tie diversity), pro- 
spectively for a DBF (prospective tie diversity, 1e , the diversity of a firm's 
collaborative profile 1f the attachment were to occur), for a partner (partner 
t:e diversity) and for the average of a partner”s set of partners (partner's 
partner tie diversity) One might think of the last measure as an assessment 
of the heterogeneity of a DBF's partner’s neighborhood 

The control variables mclude firm demographics measured at the time 
of attachment, including age (ın years), s:zze (number of employees), gov- 
ernance (whether publicly held), and locatzon, measured by three-digit 
zip code region, or in the case of companies outside the United States, by 
the telephone prefix for nation and city The key partner control 1s the 
variable form, which reflects the form of organization, and the key dyad 
control 1s type, which reflects the type of activity that ıs the focus of the 
collaboration We also include three izme period variables, which emerged 
from the discrete-time images as key inflection points in the pattern 
of affilation, along with a linear time trend variable, tıme line The 
descriptive statistics for all the variables are presented 1n appendix B, 
table B1 


STATISTICAL METHOD 


Our challenge 1s to model a set of binary indicator variables, each of 
which can occur multiple times within a firm, and for each occurence we 
must update the risk set of alternative partners and the network measures 
Consequently, our unit of analysis must be the attachment rather than 
either the individual firm or the dyad Our choice of a statistical model 
for analyzing attachment bias 1s based on our unit of analysis, as well as 
empirical and theoretical considerations Empirically, the design of our 
sample defined the population of DBFs and then identified all the partners 
engaged 1n alliances with them Theoretically, our research question asks 
what mechanisms account for differential (as opposed to simple random) 
patterns of attachment For these reasons, we use McFadden's estimator 
for multiprobabihty assessments, which 1s a variant of a conditional logit 
model that takes each event as a unit of analysis, which ın our case are 
attachments, and distinguishes between a focal population and a set of 
alternatives (McFadden 1973, 1981, also see Maddala 1986, Ben-Akiva 


connectivity defines the cohesion of the block ” A graph G 1s k-connected if k(G) > k, 
hence we use the term multzconnected A fundamental theorem of graphs 1s that a 
multiconnected &-component 1s also equivalent to a maximal graph with & or more 
node-independent paths between every pair of its nodes, which adds significantly to 
the power of the concept of multiconnectivity as a measure of cohesion 
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and Lerman 1989) ** We set up the data so that the focal DBFs are the 
population and the partners ın the risk set for an attachment are the 
alternatives, thus reflecting the one- and two-mode networks in our 
sample 

We conducted the analyses in three stages First, we perform overall 
tests of the four hypotheses, on each type of attachment, by applying 
McFadden's estimator as follows For each attachment, the probability 
of DBF : attaching to partner 7, given that DBF : attaches at time £ to 
some partner in the set J,,, 1s specified as a function of partner (X) and 
dyadic (W) variables 


exp (BX,, + YW,,.) 
prob Not = 1| M» = l, A ii ut = = 
zə >, exp (BX, + YO.) 


Some of the partner characteristics are not defined for partners without 
ties prior to the attachment We include these characteristics by interacting 
them with an indicator variable for whether the partner has any prior 
ties, and include the main effect of this indicator variable ın the models 
The main effect 1s always negative, showing that partners with no ties 
in the prior year are more likely to receive attachments 

Second, we explore the extent to which the mechanisms of differential 
attachment are contingent on various combinations of (1) characteristics 
of the focal, attaching DBF, (2) the form of a partner, (3) the type of 
activity involved in each case, and (4) combinations involving a partner's 
partners We do so by interacting variables representing each of these 
categories with measures of the attachment bias mechanisms For ın- 
stance, to explore how the focal DBF's attributes (Z) may alter the “rules” 


*! Although McFadden's conditional logit 15 commonly known as the Discrete Choice 
model, this label describes a theoretical orientation and not an analytical approach 
The model of multiprobabihty assessment 1s agnostic as to whether the process of 
attachment 1s calculative, a form of following the herd, conditioned by social structure, 
or random The model ıs equally applicable to circumstances where DBFs are choosing 
partners, the partners select DBFs, the social structure of affiliation matches DBFs 
and partners, or any combination thereof McFadden (2001) initially dubbed his es- 
timator a conditional logit, but has since indicated he prefers the term multinomial 
logit His model ıs more flexible than other conditional “fixed effects” estimators, and 
more general than standard multinomial estimators, converging to each with correct 
specification of dependent and independent variables Hence, we refer to 1t more 
generally as a multiprobabihty model 
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of attachment, where those rules are specified in terms of partner and 
dyadic variables (X,W), we estimate the following specification 


prob Yuz — 1| 2,9 = 1, > d Zu Woy = 
ni 


exp (8X, , T YW + AZ X,, + 6Z,W, .) 
2), exp (BX, + We də AZ, X,, + öZ, W, 3 


J, 


Third, we examine how the sources of attachment bias shift over time 
by interacting our measures of the attachment mechanisms with (1) period 
effects, as indicated ın table 3 and (2) a linear time trend These measures 
give us purchase on how the logics of attachment may change as the 
overall structure of the network evolves 

We estimate conditional models of attachment, continually updating 
both the risk set of alternative partners and the network measures to 
lessen problems of observation dependence common 1n dyad-level anal- 
yses Although we define DBFs as our focal population and partners as 
alternatives, both DBFs and partners are vying to connect with one an- 
other, and at any point 1n time a DBF or partner may have a fimte capacity 
for connections ^ 

We estimate all models using Stata 8/SE For the two-mode analyses, 
two features are notable First, for two-mode new attachments, there are 
5,087 events over our time period, with up to 1,600 alternative partners 
in any given year Given the number of variables we tested, the size of 
the data set needed to analyze the full population of new two-mode at- 
tachments became cumbersome Thus, we obtained a random sample of 
1,500 two-mode new attachments to form the data on which estimates 
were obtained, and for each attachment we took a 2596 random sample 
of alternative partners We then repeated the random sampling process 
and reestimated the two-mode new attachment models 10 times as a test 


2? ‘This limitation introduces a form of competitive interdependence in our observations 
That ıs, assuming that organizations have some finite need or capacity for partners, 
not all firms may be able to attach to all partners they might otherwise have chosen 
This interdependence 1s substantive, rather than statistical, ın that ıt could, ın theory, 
be specified and modeled Such specification would involve many of the variables we 
have measured, such as prior degree and experience, but would also require some 
variables such as financial resources or managerial skill that are not easily available 
This form of interdependence is, to varying degrees, present 1n many applications of 
McFadden’s estimator For instance, 1n the transportation studies (McFadden, 1984), 
each mode (car, bus, train) was vying for commuters, and not all 1ndividuals could 
choose the same mode, as doing so would crowd buses or jam highways As a result, 
we do not believe that our results are compromised by this limitation. Further research 
into the statistical properties of McFadden's estimator 1n the face of such interdepen- 
dence does, nevertheless, seem warranted 
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of robustness The results were nearly identical for every sample, and 
with no sample did the pattern of significance change, thus we present 
the initial results, noting that their significance levels would be much 
higher for the full two-mode sample 

In the two-mode analyses the comparison of organizations from dif- 
ferent sectors (e g , for-profit corporations, nonprofits, universities, gov- 
ernment agencies, and venture capitalists) requires some comment A non- 
profit hospital and a venture capital firm are not substitutes for one 
another because they provide different resources Yet, we have included 
them as alternatives ın the risk set for the same attachments From a 
practical standpoint, in this example the inclusion assumes that a focal 
DBF may have discretion as to whether its next alliance will be to obtain 
further funding or to seek clinical evidence of efficacy We tested the 
robustness of this assumption on our findings ın two ways First, we 
stratified the two-mode risk sets by form, so that only organizations of 
the same form as the partner to which the attachment occurred were 
included in the risk set Second, we included interactions of our other 
variables with a set of indicator variables for organizational form Inter- 
estingly, while there was some variation in magnitude of coefficients, the 
pattern of direction and significance of predictor variables was constant 
across partner forms save for two exceptions finance ties to venture cap- 
italists and ties of any type to biomedical supply firms These exceptions 
are not surprising, as venture capitalists are the most specialized and least 
multivocal partners (recall fig 2a—2b, which illustrates that more than 
95% of VC collaborations are financial ) Other biomedical firms provide 
equipment and research tools, thus they represent bilateral exchange re- 
lations, and do not participate in the development of new medical products 
in the same, mutual manner as do other participants Based on these 
preliminary analyses, we collapsed the two-mode data across all orga- 
nizational forms and included interactions for finance ties to venture cap- 
italists and ties of any type to biomedical supply firms when exploring 
contingencies ın the mechanisms for attachment bias 


RESULTS 


We organize the presentation of results around our four hypothesized rules 
of attachment accumulative advantage, homophily, follow-the-trend, and 
multiconnectivity We utilize a number of variables to measure each mech- 
anism Thus, assessing the relative strength of a logic of attachment re- 
quires examination of an aggregate pattern of support or falsification For 
our purposes, social mechanisms are more strongly supported when the 
bulk of the effects are in the hypothesized directions and those effects are 
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consistent across all four classes of ties The full table of results 1s both 
elaborate and complicated, 1t 1s presented 1n appendix B. For the sake of 
exposition, we have excerpted the results that are the primary tests of 
each hypothesis into a series of four more manageable tables (see tables 
4—7 below) To facilitate interpretation, we present the odds ratios, rather 
than the coefficients The odds ratios are obtained from the coefficients 
by exponentiation That 1s, 


odds ratio for X, = exp (£,) 


The odds ratio gives the amount by which the odds of an attachment 
being biased toward a partner are multiplied for each unit increase ın the 
level of the 1ndependent variable, X, for that partner (odds ratios less 
than one reflect negative effects) 

Accumulative advantage —The odds ratios for our tests of accumulative 
advantage, hypothesis 1, are presented in table 4 Note that we employ 
a narrow reading of accumulative advantage, focusing only on the struc- 
tural feature of popularity bias Odds ratios greater than one for the 
variables partner degree, partner experience, prior ties and prior expe- 
rience would offer support for the accumulative advantage mechanism 
For the variable new partner, hypothesis 1 would predict an odds ratio 
lower than 1 Overall, the evidence does not favor this hypothesis Only 
the negative impact of partner experience 1s consistent across all four 
categories of ties Whether the relevant experience resides with the partner 
or dyad, these findings indicate a preference for novelty These results 
run sharply counter to an accumulative advantage argument The pattern 
of results for the other measures of accumulative advantage varies by 
type of attachment 

For partner degree, we find support for accumulative advantage only 
for one-mode new ties These linkages have a distinctive pattern, as they 
are biased to partners with more ties and less time ın the network This 
result implies that DBFs seek to balance novelty and visibility when they 
form an alliance with another DBF For each additional tie, the probability 
that a new attachment will involve a partner with higher degree increases, 
on average, by 4% The contingencies reported ın appendix table B2 
indicate that the increase 1s higher if the attaching DBF 1s younger and 
declines as the focal DBF matures For each year of a partner’s network 
experience, the differential likelihood of attachment decreases by 5% 
These findings suggest that as DBFs age, the salience of partner degree 
decreases, and the importance of partner experience recedes as partners 
spend more time 1n the field Both results emphasize that prospecting for 
new partners is common, particularly among older DBFs who opt for 
new partners at the expense of more visible ones In sum, preferential 
attachment by degree operates only for new one-mode ties among DBFs 


- 
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TABLE 4 
TEST OF ACCUMULATIVE ADVANTAGE ODDS RATIOS FROM MCFADDEN’S MODEL 











ONE-MODE Two-MODE 
HYPOTHESIS New Repeat New Repeat 
Partner degree >l 1 042** 996 917** 1 007 
Partner experience >l 947** 867** 807** 865** 
Prior ties >l NA 1 389** NA 1 262** 
Prior experience >1 NA 742** NA 862** 
New partner «1 1 169 10 953** 1 662** 788 





NOTE —Controls for homophily, follow-the-trend, and multi-connectivity were also included in these 
models, odds ratios for these mechanisms are presented 1n subsequent tables All models include fixed 
effects for firm, year, and type, as well as main effects for form The complete models are presented in 
app B NA indicates not applicable for new ties 

* P< 05, two-tailed significance 

** P< Ql 


For all four classes of ties, partners that are more recent entrants are 
preferred 

Prior ties and prior experience offer dyad-level operationalizations of 
accumulative advantage that are relevant for repeat ties only (table 4) 
As with partner experience, the effects for prior experience refute the 
accumulative advantage hypothesis A one-year increase 1n shared part- 
nership experience lowers the probability of a repeat tie by 26% and 14% 
for one- and two-mode attachments, respectively Prior ties at the dyad 
level do fit the hypothesis, however An increase of one common tie ups 
the likelihood of a repeat connection by 39% and 26% for one- and two- 
mode attachments 

Finally, consider the new partner variable Here an accumulative hy- 
pothesis predicts a negative effect The results for repeat one-mode ties 
and new two-mode ties weigh against the hypothesis A partner’s arrival 
in the network for the first year produces a strong bias for both subsequent 
repeat one-mode and new two-mode attachments, increasing the differ- 
ential attachment probability elevenfold and by 66%, respectively Repeat 
one-mode affiliations reflect sponsored mobility, where a new DBF 1s 
assisted by another firm New two-mode alliances are less likely to be 
pursued with an already popular partner than with a new arrival, sug- 
gesting these collaborations follow an exploratory logic Table B2 shows 
a contingency for this preference for new two-mode partners to become 
particularly strong ın the mid- to late 1990s The fact that this trend 
becomes more pronounced dver time suggests that rather than being a 
consequence of a young field, the ımportance of novelty ın partner selec- 
tion 1s an enduring feature of an industry characterized by rapid technical 
advance 

In our dance hall language, when a DBF finds a young (1 e , often newly 
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entered) DBF dance partner that has created a stir on the dance floor, 
the two partners sustain the relationship New two-mode partners are 
also preferred ıf they are new to the dance floor, but the alliance 1s less 
likely to be renewed These results suggest that well-positioned, veteran 
DBFs spot up-and-coming newcomers and escort them into the network 
In some cases, these new firms are spinoffs from more established DBFs, 
which may account for the very strong support for repeat ties with new- 
comers In sum, we find scant support for the hypothesis that repeat 
attachments are influenced by the accumulative advantage of the partner 

Homophily Table 5 presents the odds ratios for our tests of homophily 
(hypothesis 2) We find evidence for homophily if the odds ratios exceed 
1 for co-location and governance similarity and if the odds ratios are 
lower than 1 for the measures of distance and difference The results for 
homophily vary both within and across classes of attachment For the 
sımılarıty measures (co-location and public/private governance), four of 
the odds ratios are significantly supportive, while four run contrary to 
the hypothesis For the distance and difference measures, however, only 
three of the odds ratios offer strong support for the hypothesis, with 17 
contrary Overall, homophily holds more strongly for new attachments 
Homophily 1s less prevalent for one-mode repeat ties, and absent 1n repeat 
two-mode attachments 

In the one-mode case, similarity looms large with regard to geographic 
co-location, but not for organizations of similar governance, age and size 
Firms located close to one another are twice as likely to collaborate, but 
similarity 1n organizational age and size decrease the odds of collaboration 
Thus, DBFs are more likely to attach to nearby DBFs that differ ın age 
and size This finding echoes the previous accumulative advantage result, 
with larger, veteran DBFs collaborating with smaller newcomers The 
contingencies ın table B2 show that the preference for age and size dı- 
versity moderates somewhat ın later years There 1s weak evidence for 
an attachment preference based on simular collaborative profiles and geo- 
graphic propinquity ın the partner’s neighborhood for new one-mode ties 
For repeat one-mode attachments, diversity rather than similarity ıs the 
rule with respect to collaborative distance 

Homophily receives some support ın new two-mode attachments, where 
there 1s a tendency to involve partners whose neighborhood includes other 
DBFs with collaborative profiles similar to the focal DBF A unit decrease 
in the average Euclidean distance between the collaborative profiles of 
the attaching DBF and other DBFs ın the partner's neighborhood ın- 
creases the likelihood of attachment by 8% But the pull of diversity 15 
also present, particularly with respect to a partner's own collaborative 
profile Thus, the logic of homophily 1s attenuated by a preference for 
diversity 1n new two-mode partners 
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TABLE 5 
TEST or HOMOPHILY Opps RATIOS FROM McFADDEN'S MODEL 








ONE-MODE Two-MODE 


HYPOTHESIS New Repeat New Repeat 





Co-location əl 1958 20091” NA NA 
Partner”s partner co-locatıon >l 1294* 460 1 098 129** 
Governance similarity əl 864 2 0377 NA NA 
Collaboratıve dıstance <1 973* 10537 1196** 10177 
Partner’s partner collaborative 

distance <l 997 1 689** 920** 1319** 
Size difference <l 1018** 1001 NA NA 
Partner’s partner size 

difference «1 991 995** 10003” 991 
Age difference «1 1036** 1137** NA NA 
Partner’s partner age 

difference «1 980 1 063 1 005 1 161** 


NOTE —Controls for accumulative advantage, follow-the-trend, and multi-connectivity were also m- 
cluded 1n these models, odds ratios for these mechanisms are presented 1n prior and subsequent tables 
All models include fixed effects for firm, year, and type, as well as main effects for form The complete 
models are presented ın app B Size 1s divided by 100 NA indicates that the two-model data were not 
available for computation 

* P« 10, two-tailed significance 

* P< 05 

** P< 01 


In the tests of homophily as an attachment rule for repeat two-mode 
attachments, there 1s a distinct pattern of dissimilarity in terms of de- 
mography, location, and collaborative profile between the attaching DBF 
and the other DBFs attached to a prospective partner For example, an 
average difference in age of one year increases the probability of collab- 
oration by 16% Repeat two-mode attachments are 87% less likely to 
occur when the focal DBF 1s co-located with a partner’s other DBF allies 
As was the case with accumulative advantage, two-mode tests of hom- 
ophily also provide strong support for an attachment process driven by 
novelty and diversity 

Follow-the-trend —The odds ratios presented ın table 6 test for an 
attachment bias based on a logic of appropriateness Hypothesis 3 1s 
supported if the odds ratios exceed 1 for both the dominant trend of the 
field and the dominant type of the partner Both variables are measured 
in percentage point units To facilitate interpretation, we use a change of 
1096 to assess the effects of dominant type and trend For dominant type, 
the odds ratios paint a consistent picture supporting hypothesis 3 for all 
classes of alliances For dominant trend of the field, new and repeat 
attachments offer divergent results, with repeat ties following a logic 
counter to the overall trend 

New one-mode ties manifest a strong tendency for DBFs to follow both 
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TABLE 6 
TEST OF FOLLOW-THE-TREND Opps RATIOS FROM MCFADDEN'S MODEL 





ONE-MODE Two-MODE 
HYPOTHESIS New Repeat New Repeat 
Dominant type >l 1 027** 1 039** 1 025** 1 029** 
Dominant trend >1 1 251** 8057 1 064** 8847 


NOTE —Controls for accumulative advantage, homophily, and multi-connecttvity were also included 
ın these models, odds ratios for these mechanisms are presented ın prior and subsequent tables All models 
include fixed effects for firm, year, and type, as well as main effects for form The complete models are 
presented 1n app B 

* P« 05, two-tailed significance 

** P< Ol 


the dominant trend and dominant type of activity A 10 percentage-point 
rise 1n the dominant type measure increases the differential attachment 
probability by 30% The same rise ın the dominant trend measure ın- 
creases the differential attachment probability ninefold The choice to 
follow the trend 1s also apparent 1n new two-mode ties, both m terms of 
the dominant trend ın the field and the partner's dominant type The 
conditional attachment probability 1s boosted by 30% and 85% for each 
10% increase 1n the type and trend measures In both classes of repeat 
ties, following the trend 1s apparent for the dominant type of partner, but 
there 1s now a strong preference to buck the field’s overall trend With 
each 10% increase ın the trend measure, repeat one-mode attachments 
are nearly 90% less likely to occur 1n the same category, and repeat two- 
mode attachments are 7096 less hkely 

The follow-the-trend results suggest that “first dances" are conservative, 
matching the pattern on the floor, which signals that norms of propriety 
condition the logic of attachment The reverse holds, however, for repeat 
ties where deeper hnkages between already connected dyads offer the 
occasion to flout the dominant logic This reinforces findings from tests 
of homophily where diversity drove repeat ties The follow-the-trend re- 
sults are the first to show a uniform pattern (all significant at P « 05, 
consistent with hypothesis 3) across one- and two-mode ties 

Multıconnectıvıty —Table 7 reports tests for the multiconnectivity hy- 
pothesis Hypothesis 4 predicts positive (> 1) effects for all variables There 
1S consistent support for the cohesion and partner tie diversity measures 
across all four classes of ties Six out of the eight possible effects of partner 
and shared cohesion are significant (P < Ol) ın the predicted direction 
Cohesion can be regarded as a strategic window where partners evaluate 
a radar screen of possible future partners, as opposed to sampling from 
the entire disparate field The strength of shared cohesion ın shaping two- 
mode attachments illustrates the value of hnkages to multiple affihates 
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TABLE 7 
TEST OF MULTICONNECTIVITY ODDS RATIOS FROM MCFADDEN’S MODEL 











ONE-MODE Two-MoDE 
HYPOTHESIS New Repeat New Repeat 
Partner cohesion >l 1 432** 2 589** 1 667** 1 080 
Shared cohesion əl 1 0587 1 103 52726 1 9057” 
Partner tie diversity >l 102577 1016 1 037** 1 020** 
Prospective tie diversity >l 972 738** 883** 952** 
Partner’s partner tie diver- 
sity sl 1042 9447” 986** 959*** 





NOTE — Controls for accumulative advantage, homophily, and follow-the-trend were also included 
in these models, odds ratios for these mechanisms are presented tn prior tables All models include fixed 
effects for firm, year, and type, as well as main effects for form The complete models are presented m 


app B 
* P< 05, two-tailed significance 
** P< O01 


as a means to validate information Cohesion entails more than access to 
information, however Resources, skills, access to personnel, and a host 
of related benefits are also obtained through partners The results indicate 
that as DBFs scan for potential collaborators, a partner’s diversity of ties 
is a valuable marker of resources and information 

Both partner and shared cohesion influence new one-mode attachments 
For each additional level of partner connectivity, the differential proba- 
bility of a new link jumps by 43% Similarly, each new independent 
pathway connecting a dyad (the increment 1n shared cohesion) increases 
the differential attachment odds by 6% In addition, new one-mode af- 
filiations evince a preference for partners that have more varied ties A 
10% increase ın partner's tie diversity raises the differential probability 
of attachment by 30% 

For repeat one-mode attachments, partner cohesion continues to pro- 
vide a positive influence, increasing the conditional attachment odds 2 6 
times for each additional level of connectivity Shared cohesion and part- 
ner tie diversity, from which the DBF already benefits, lose some of their 
appeal 1n repeat tie formation Further, the coefficients on prospective tie 
diversity and partner’s partner tie diversity turn negative Relatively typ- 
ical increases of 1% and 10%, respectively, 1n these tie diversity measures 
lower the differential probability of affiliation by 25% and 45% (see app 
B, table B1, for means and SDs for these measures) Thus, repeat one- 
mode attachments serve to reinforce rather than expand prior collabo- 
rative activities These results suggest a different evaluative metric for 
repeat ties, 1n which the parties forego increased diversity 1n favor of more 
cohesive (and possibly long-term, and deeper) relations We lack detailed 
data on the content of specific relations, but one might consider that these 
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participants have come to trust one another more and now are bound 
together 1n a common fate with respect to a new medical product, or that 
the information being exchanged 1s tacit or sticky, and thus more easily 
shared 1n a strong-tie relationship 

Cohesion and partner's tie diversity positively influence attachment for 
new two-mode linkages The effects of cohesion are quite strong each 
increase 1n the level of cohesion of the partner bumps up the probability 
that a collaboration will form with this partner by 66%, while each ad- 
ditional path of connectivity for the dyad increases the differential prob- 
ability of attachment more than fivefold There 1s a bias, however, against 
partners whose allies have more diverse profiles of collaboration This 
pattern seems to indicate that DBFs favor attachments to partners that 
are better positioned with more diverse collaborations, while, ignoring 
contingencies, partners prefer attaching to DBFs with more specialized 
collaborative profiles Hence, comparative judgments about partner di- 
versity reverberate through the network The negative coefficient of pro- 
spective tie diversity might signal a preference for reinforcing the DBF’s 
prior collaborative profile, rather than filling out the DBF's “dance card” 
with different forms of partners for diverse types of activities A per- 
centage point increase ın the index of prospective diversity decreases the 
odds of attachment by 12% 

The results for multiconnectivity 1n repeat two-mode ties demonstrate 
that shared cohesion of the dyad and the partner’s tie diversity are both 
sources of positive attachment bias The impact of an increase of one 
pathway of connectivity between a DBF and a partner enhances the 
differential probability of attachment by 90% A 10% increase ın a part- 
ner’s tie diversity boosts the probability of a repeat two-mode tie to the 
partner by 20% The prospective tie diversity of a potential alliance and 
the partner tie diversity are both sources of negative bias, by 496—596 for 
each percentage point increase in the respective index 

The odds ratios for prospective diversity run counter to hypothesis 4 
for three of the four classes of attachments (all but new one-mode ties) 
These results may indicate that diversity does not represent a goal in and 
of itself When interaction effects are added (see app B, tables B2 and 
B3, cols 2 and 3, for one- and two- mode results, respectively), the main 
effects of prospective tie diversity flip the sign for three of the four classes 
of attachments (all but repeat one-mode ties) ^ Hence, the search for 


? For this pattern to obtain, it 1s sufficient to add interaction terms for cohesion (shared 
cohesion x firm age, firm cohesion x partner cohesion, firm cohesion x prospective 
diversity) The change in signs 1s stable with period and timeline effects added to the 
conditional logit model Change in the signs of the coefficients do not occur in appendix 
tables B2 and B3 for any of the other variables listed ın table 7, with one minor 
exception The contingency of combined firm cohesion x prospective diversity, for 
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diversity 1s most operative among firms at low levels of structural co- 
hesion At higher levels of cohesion, the bias toward diversity recedes 1n 
favor of a preference for partnerships with new entrants Thus, including 
interactions between diversity and cohesion measures indicates a modi- 
fication of hypothesis 4 Those DBFs with higher cohesion, having ob- 
tained diversity through indirect k-connectivity, find a diminished need 
to 1ncre...« diversity ın new or repeat partners Cohesion and prospective 
tie diversity represent countervailing forces at higher levels If cohesion 
was the most powerful force, the radar screen of possible partners would 
become restricted and then ossify The exploration of diversity precludes 
such lock in 

The process of searching for partners 1s both dynamic and recursive 
Preferential attachment operates through the attractiveness of shared and 
partner cohesion, with firms apparently moving up a ladder 1n terms of 
the cohesiveness of their networks At the lower rungs of the cohesion 
ladder, there 1s a preference for expanding diversity by linking to well- 
connected partners At the higher rungs of the cohesion ladder, firms may 
forgo cohesion, opting to ally with recent entrants to the field This re- 
lationship 1s suggestive of a systemic pumping action, with the most con- 
nected members pushing out ın a diastolic search to pull ın newcomers, 
and those less connected being pulled inward, in a systolic action, to attach 
to those with more cohesive linkages The pumping process operates up- 
wardly from the bottom, level by level 

We illustrate this dynamic process with two figures We return to Pajek 
visualizations, but now we use them to illustrate the econometric results 
with respect to the effects of cohesion and diversity Figure 10 contains 
all members of the main component in 1997, with. node size scaled to 
network degree and color reflecting levels of connectivity The red nodes 
are members of the five component, the cluster with at least five node- 
independent paths between all pairs of members, which 1s a remarkably 
cohesive community The red nodes, not surprisingly, tend to be bunched 
at the center of the figure Green nodes are members of the three and 
four components, with blue nodes 1n the one and two components Note 
the tiny blue nodes at the center of the figure These are new entrants to 
the network, with few connections (reflected ın their small size which 
represents low degree), but these nodes have “high quality” links to other 
well-connected organizations This pulling 1n of newcomers reflects the 
process of sponsored mobility we referred to above Figure 11 portrays 
new ties ın 1998 among members of the main component There were 
1,121 new ties in 1998, of which 1,074 were forged by members of the 


example, predicts lower attachment, but shifts the prediction for prospective diversity 
to higher attachment 
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main component Again, node size 1s scaled to degree 1n 1997 and color 
reflects levels of connectivity There are but 112 organizations 1n the most 
cohesive component (red), but these organizations are responsible for 5996 
of all ties initiated ın 1998 Note also that the red nodes are no longer 
exclusively clustered ın the center but are dispersed Both the physical 
placement of the red and green nodes and their hnks to a large number 
of small blue triangles (representing new entrants to the network ın 1998) 
reflect the generative role of the most-connected organizations The ex- 
ploratory role of the “elite” red nodes may well account for the strong 
confirmation of the follow-the-trend hypothesis Taken together, the two 
figures portray the processes of pulling less connected organizations into 
the network and the very active search undertaken by members of the 
most cohesive component (The network images 1n the online version of 
this article include the full sequence of figures for this search process ) 

The odds ratios for partner’s partner tie diversity also appear to run 
counter to hypothesis 4 for three of the four types of attachments A 
plausible account of these results 1s that 1f a potential partner organization 
has collaborations with organizations that have even more diversity than 
the ostensible target partner, that relationship may be foregone ın favor 
of a direct tie with the more “attractive,” distant organization (1e, a 
partner's partneı, or a friend of a friend) Such an interpretation ıs contrary 
to hypothesis 4, but does suggest a persistence of the preference for di- 
versity Note too that such calculations about diversity depend on cohesion 
to facilitate recognition of the types of affiliations that are held by various 
participants Overall, the results of table 7 suggest that structural cohesion 
matters for the formation of multiple classes of ties This relationship has 
two components First, DBFs attach to organizations that are also mem- 
bers of their own component, thereby deepening multiconnectivity ın those 
clusters Second, biotechnology firms at the top of the cohesion hierarchy 
also reach out to partners located 1n more distant components 

There are a number of contingencies that add to the overall pattern of 
multiconnectivity First, the older the focal DBF, the less the one-mode 
preference for shared cohesion surrounding the dyad Second, among 
DBFs 1n low-cohesion networks, attachment 1s tipped toward partners 
that increase prospective diversity The higher the firm cohesion, the 
greater the attachment bias from the partner’s cohesion, but the less the 
bias from prospective diversity This latter finding suggests that the pat- 
tern of new attachments among those DBFs ın high cohesion networks 
is one of seeking out partners with a diverse portfolio of linkages rather 
than allying with other biotechs sequentially to complete the stages of the 
drug development process Put differently, high cohesion firms are not 
trying to balance their dance card of affiliations by having a set of partners 
for each functional activity In contrast, low cohesion DBFs are trying to 
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fill out their dance cards and this tendency looms largest ın later years 
Third, the effect of partner cohesion wanes over time, while that of partner 
diversity waxes, indicating that the influence of multiconnectivity 1s shift- 
ing from cohesion to diversity This shift suggests a change ın the topology 
of the network, from a power-law distribution toward a more exponential 
distribution, with the latter suggesting more distant and random search 
for new partners 

The results for the multiconnectivity hypotheses are generally positive, 
with nine out of twelve relationships supported significantly for partner 
cohesion, shared cohesion, and partner tie diversity The results turn neg- 
ative for prospective tie diversity and partner's partner tie diversity Upon 
examining the contingencies, however, we were led to consider that these 
results indicate a more dynamic, reciprocal view of cohesion and diversity, 
in which low levels of cohesion trigger a preference for more connected 
partners and high levels of cohesion permit exploration in the form of 
search for new alliance prospects 


CONCLUSIONS 


The tripartite set of analyses we have presented highlight a network 
structure ın which multiconnectivity expands as the cast of participants 
increases, and, 1n turn, diversity becomes more important with time. At 
this point ın the evolution of the field, a combinatorial or multivocal logic 
has taken root Neither money nor market power—not even the sheer 
force of novel 1deas—domunates the field Rather, those organizations with 
diverse portfolios of well-connected collaborators are found 1n the most 
cohesive, central positions and have the largest hand ın shaping the evo- 
lution of the field This ıs a field ın which the shadow of the future 1s 
long, as much remains to be learned about the functional aspects of mo- 
lecular biology and genomics The density of the network and the open 
scientific trajectory combine to enhance the importance of participants’ 
reputation The pattern of cross-cutting collaborations often results in a 
partner on one project being a rival on another The frequent rewiring 
of attachments means that participants have to learn how to exit from 
relationships gracefully so as not to damage future prospects for affiliation 
In a system where external sources of knowledge and resources are widely 
differentiated, a preference for diversity and affiliation with multicon- 
nected partners has mobilizing consequences 

The three sets of analyses complement one another and provide insight 
into the dynamics of affiliation 1n this field In the simplest terms, over 
the period 1988-99, the primary activities for which organizations engaged 
in collaboration shifted from commercialization to finance and R&D 
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These two activities reinforce one another, as financial support fuels ex- 
pensive research efforts and research progress attracts venture capital In 
turn, as a smaller set of more diverse participants became skilled at pur- 
suing multiple tasks, a more complex strategy emerged 1n which some of 
the most connected participants engaged ın all four types of activity with 
a diverse set of partners The spring-embedded graph drawings show a 
change in both the composition of participants and their activities over 
time To return to the analogy of the dance hall, both the music and the 
dancers shift over time The early dances are dominated by large mul- 
tinationals and first-generation biotech firms, collaborating to the tune of 
commercialization of the lead products of the younger firms, with the 
bigger pharmaceutical firms garnering the lion’s share of the revenues 
Research progress, strongly supported by the stable presence of the NIH, 
attracts new participants to the dance and also enables incumbent biotech 
firms to deepen their product development pipelines and become less 
tethered to the giant pharmaceutical companies Research progress at- 
tracts venture capital funding, which 1s excited by new possibilities Thus, 
the music changes from commercialization to R&D partnerships and ven- 
ture financing Many of the larger multinationals are pushed to the pe- 
riphery and some drop out of the network altogether 

Recall that this dynamic field 1s growing swiftly over this period, adding 
new entrants as progress 1s made along a broad scientific frontier 1n which 
no single organization can develop a full range of scientific, managerial, 
and organizational skills A diverse set of organizations become cohesive, 
central players at the dance—a handful of research universities, key gov- 
ernment agencies, a few elite research hospitals, a larger number of biotech 
firms, and a core of multinational giants The multinationals go through 
an era of consolidation, with mergers and acquisitions commonplace, and 
their numbers are reduced considerably Those that survive this period 
emerge as multivocal agents, no longer dancing only to the commercaal- 
ization beat, but capable of executing R&D partnerships and startup fi- 
nancing Venture capital remains at the dance, a critical specialist, but a 
somewhat fickle one that 1s easily attracted to other scenes, such as ın- 
formation and communications technologies A small number of first- 
generation biotech firms grow into reasonably large organizations ın their 
own right, and they do so by having learned how to engage ın all forms 
of collaboration with a heterogeneous set of partners Here we see strong 
affirmation of the finding from previous research that the organizations 
did not become "players" by virtue of being larger, but grew larger pre- 
cisely because they became central players 1n the field (Powell et al 1996) 

When we consider the rules that guide attachment in this field, we 
observe that no single rule dominates over all time periods Aside from 
the venture capital firms and biomedical supply companies, all of the 
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participants are engaging 1n multiple activities and these different pursuits 
are shaped by divergent rules of attachment The multıprobabılıty models 
tease out the mechanisms that undergird preferential attachment These 
results lend the weakest support to the accumulative advantage or rich- 
get-richer hypothesis The visualizations also highlight the 1mportance of 
new connections and illustrate that growth ın the network 1s often spurred 
either by new entrants or incumbents embarking on new activities But 
the Pajek-generated figures suggest something the logit models cannot 
reveal—a very small core of perhaps one or two dozen organizations are 
routinely placed 1n the center, and their node size grows somewhat over 
the period These organizations are the ones with the most diverse array 
of collaborations, suggesting that for this handful of central players, ac- 
cumulative advantage 1s fueled by multiconnectivity Thus, we see a com- 
bination of rich-get-richer and multiconnectivity at work for these core 
participants, and they may well set the pace for the dominant trend ın 
the field The new tie visualizations do not reveal large nodes at the center, 
growing even larger as the field matures This suggests an open elite, 
accessible to novelty as the field expands 

The visuahzations are somewhat limited 1n their ability to capture the 
mechanisms of homophily, and here the multıprobabılıty models are much 
more effective at showing a preference for collaborating with organiza- 
tions that are geographically proximate If the unit of analysis 1s the dyad 
alone, there is little homophily operating But when we consider second- 
order networks and ask about similarity of a class of possible partners 
compared to the chooser’s existing allies, there are tendencies toward 
homophily 

Both the visualizations and the multiprobability models illustrate that, 
as a strategy for attachment spreads, ıt 1s quickly adopted Whether this 
is through learning or imitation, or an indissoluble mixture of the two, 
we cannot say But it 1s neither costless nor easy to follow the trend Lots 
of ties are broken, and there 1s considerable exit from the field as well as 
the main component So even if imitation is common, replication is not 
easily accomplished 

The most fundamental attachment biases are to multiconnectivity and 
diversity—that 1s, to aligning with varied partners who are more 
broadly linked or to new entrants who are sponsored by nodes that are 
well placed This rule 1s robust ın that ıt created a field with multiple, 
node-independent pathways that pulled ın promising newcomers, while 
pushing out incumbents that failed to keep pace This logic of attachment 
dominated ın a period of overall expansion, however One cannot assume 
these processes hold 1n all environments 

Speculation about whether this network will consolidate, and perhaps 
become an obstacle to rather than a catalyst of innovation, generates much 
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discussion among industry analysts and members of the scientific com- 
munity Clearly as long as the technological trajectory continues to gen- 
erate new discoveries and opportunities, expansion 1s possible Unlike 
some technologically advanced fields such as computing or telecommu- 
nications, there are few advantages for users 1n terms of common stan- 
dards or brand recognition As yet, no general purpose technology has 
emerged that would speed consolidation A prolonged inabihty to raise 
money, due to declining research budgets or inhospitable public equity 
markets, would threaten undercapitalized firms and might spur consoli- 
dation The diversity of institutional forms—public, private, and non- 
profit—that are active ın the field are located ın different selection en- 
vironments This diversity offers some protection against unfavorable 
economic conditions Preference for cohesion and diversity may well be 
effective search mechanisms for multiple, alternative solutions to problems 
in a field where the progress of human know how has been highly uneven 
Network cohesion and diversity obviously have their limits, though these 
are not topics that have been well researched We have shown ın related 
work that there are declining returns to network experience (Powell et 
al 1999) Moreover, repeated shocks to any system—whether in the form 
of a prolonged downturn ın the business cycle, rapid inflation ın costs 
such as 1n health care, or highly publicized product failures or corporate 
scandals—can destabilize ıt and result ın a tip ın the rules of affihation 
and the resulting combinatorial possibilities We are adding data now on 
the period 2000-2003, so we will soon see how stable both the rules of 
attachment are and the structure of the network 1s 1n the face of a period 
of economic uncertainty and decline 

Clearly, some aspects of the life sciences are rather idiosyncratic There 
are a wide range of diverse forms of organizations that exert influence 
on the development of the field In many other technologically advanced 
industries, universities were critical in early stage discovery efforts, but 
as the technology matured, the importance of basic science receded In 
biotech, universities continue to be consequential, and career mobility 
back and forth between university and industry 15 now commonplace 
(Owen-Smith and Powell 2001a, 2004) While the institutional demog- 
raphy of the life sciences may be unusual, the rapid pace of development 
and the wide dispersion of centers of knowledge are more typical of other 
high-tech fields The key story, 1n our view, 1s less the issue of the nature 
and distribution of resources and more how these institutional features 
promoted dense webs of connection that, once 1n place, influenced both 
subsequent decisions and the trajectory of the field With detailed lon- 
gitudinal data, we show how the topology of a network emerged, and 
generated novelty in an institutional system that has many conservative 
elements The logics of attachment strongly shaped how both new ınfor- 
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mation and entrants were integrated into the field The coevolution of 
science and commerce was marked by potent micro-macro linkages that 
altered the global properties of the field We have demonstrated how a 
preferential bias for collaborating with actors that are either more multiply 
or differently inked reshaped the landscape of biotechnology 


APPENDIX A 


Network Visualization 1n Pajek 


Payek, a freeware program for the analysis of large social networks, offers 
new opportunities for generating meaningful and replicable visualizations 
of complex network data ** This appendix offers an overview of the ben- 
efits and limitations of the software, while providing a more detailed 
discussion of the steps used to develop the 1mages we presented 

Pajek's strengths and limitations both arise from its emphasis on vı- 
sualızıng and manipulating very large networks ” The most obvious ben- 
efit is the significant simplification of the analysis of large-scale network 
structures As the algorithms are optimized for speed, computationally 
simple but powerful structural and ego network measures are also 1m- 
plemented ”” More important for our purposes than such descriptive mea- 
sures are Pajek’s capacities to generate replicable mages of complex net- 
works When turned to the analysis of discrete time “pictures” of the 
evolution of a network, Pajek offers the best approximation of dynamic 
visualization currently available 

Visualization —Pajek includes a set of network drawing algorithms 
based on both graph theoretic conceptions of distance 1n a network and 
the physical theory of random fields (Guyon 1994) These minimum energy 
or “sprıng-embedded” network-drawing algorithms permit a robust rep- 
resentation of social network data ın two-dimensional Euclidean space ”” 


^ Pajek was developed by Vladimir Batagelj and Andrey Mrvar and 1s available online 
at http //vlado fmf unı-l) s/pub/networks/pajek/ Pajek has been used ın a number of 
disciplines to represent complex network data (Albert et al 1999, Batagelj and Mrvar 
2000, Moody 2001, 2004, Owen-Smith and Powell 2004, Owen-Smith et al 2002, White 
and Harary 2001) We thank Andrej Mrvar for comments on this appendix 


“ The program will handle networks of up to 1 million nodes Due to limitations 
imposed by the computational capacities of our machines, we have never analyzed a 
network larger than 250,000 nodes 


“ Consider simple centrality measures Pajek calculates degree, closeness and between- 
ness measures for large nets, but does not 1mplement more complex measures such as 
information or power centrality 

77 One algorithm also offers the option to visualize a network as a three dimensional 
sphere While such representations are valuable for some analyses, they raise difficulties 


for two dimensional presentation Thus, we emphasize two-dimensional optimizations 
here 
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We draw first on the Fruchterman-Reingold algorithm (FR, Fruchterman 
and Reingold 1991), which optimizes network images without reference 
to the graph theoretic distance among nodes, to develop initial positions 
for all organizations (connected and unconnected) We then turn to a 
second algorithm, the Kamada-Kawaı (KK, see Kamada and Kawai 1989) 
to reposition the connected nodes in the network Where the FR algorithm 
positions all nodes by analogy to a force-directed physical system, the KK 
algorithm locates connected nodes adjacent to one another and makes 
Euclidean distances among nodes proportional to graph theoretic dis- 
tances Put differently, the KK algorithm visually represents a system 
where the distance between nodes 1s a function of the shortest network 
path between them Taken together, these two algorithms generate sub- 
stantively interpretable visual representations of networks, which place 
isolated organizations on the periphery of the image, while capturing the 
pattern and density of collaborative activity and reflecting the extent to 
which these linkages generate meaningful clusters of organizations ”” 

We find that the two optimization procedures are most effective when 
used sequentially Such a strategy takes advantage of the best features of 
both algorithms Fruchterman-Reingold’s capacity to reposition 1solated 
nodes and to separate densely connected clusters from one another pro- 
vides a useful first step for visualization ? As ıt 1s based on a mathematical 
analogy to a physical system, the FR algorithm positions nodes without 
reference to the graph theoretic properties of networks that underlie most 
common network methods One consequence of this style of optimization 
is a tendency to visually overlay unconnected nodes that share ties to a 
common partner The FR algorithm, for instance, does not effectively 
represent the “star” configurations that are commonplace in growing net- 
work structures Because the KK algorithm takes graph theoretic dis- 
tances into account, visualizations generated using ıt are potentially of 
greater substantive interest to social network theorists We generate the 
images presented in this paper using three one-minute-long optimizations 
An FR optimization from random starts 1s used first and taken as a starting 
point for a pair of KK optimizations 

A lexicon for Pajek visualizations —Pajek's visual flexibility offers sev- 
eral options to convey complex information through mages Consider our 
figures, which draw upon a simple “lexicon” of four parameters position, 


28 While the repelling forces of nodes are determined by a constant (though manipu- 
lable) factor, the attractive strength of tres can vary with the observed value of ties 
In this paper, our images are created with constant tie strengths 

29 As implemented ın Pajek, the Kamada-Kawai algorithm (KK) only alters the position 
of connected nodes as network distances among unconnected portions of a graph are 
undefined By the same token, KK tends to overlap unconnected components, making 
the visual analysis of network evolution difficult 
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color, size, and shape We use position (the outcome of FR and KK op- 
timizations) to discuss the clustering of organizations ın the macronetwork 
We use color to distinguish among forms of organizations and the types 
of activities they are involved 1n. We turn to node size, scaled to reflect 
standardized network degree, to communicate variations 1n connectedness 
across organizational forms and to suggest the effect of prior network 
degree on the propensity to form new ties In principle, node size can be 
scaled to reflect variation on any real number variable and need not be 
limited to information. generated by Pajek For coordinating statistical 
analyses with the visualization, ıt 1s useful to scale node size to reflect 
differential levels of network measures of node attributes that provide 
explanatory purchase 

In figures 5, 7, and 9, shape expresses the distinction between incumbent 
(circle) and entrant (triangle) organizations, while maintaining the same 
color variations across organizational form The shape distinction, then, 
along with tie and node color and node position, visually suggests dif- 
ferences in the attachment profiles of entrants and incumbents across 
organizational forms Taken together, these four parameters offer a wide 
range of possibilities for exploring data and conveying complex results in 
network visualizations 

Limitations —Despite the many novel and useful features of Pajek, 
there are also limitations Pajek cannot perform most statistical tests of 
significance, and cannot be used to calculate some commonly used net- 
work measures More importantly, features of the KK and FR optımı- 
zation procedures bear upon the interpretation of node position, the com- 
parability, and replicability of images The key difficulty lies in the 
algorithm’s probabilistic optimization procedures The danger of such 
procedures 1s that they may, 1n repeated trials, converge on different local 
minima rather than finding a global minimum energy by which to position 
nodes As a result the position of extreme outlying nodes can shift from 
optimization to optimization This variability occurs most commonly in 
the location of isolates and nodes with single connections * The relative 
position of nodes toward the center of a given network are vastly more 
stable and reliable Likewise, the relative position of different components 
as positioned in the FR algorithm may vary 

This limitation has several substantive implications First, standards 
for the robustness of optimizations cannot be couched ın terms of the 
exact Euclidean position of nodes We rely instead on the aggregate fea- 
tures of the network and the position of core organizations (note, e g , our 


* To test this variation we optimized the same network multiple (30) times from random 
starting points and took the mean and variance of node coordinates, finding the least 
variation ın the position of well-connected (high degree) nodes 
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use of the NIH, a very well connected node, as a touchstone in the pull 
out presented ın fig 4) More important, the position of nodes cannot be 
understood in absolute terms Instead node position must be mterpreted 
relative to the position of other nodes 1n the network The emphasis on 
relative position does make ıt meaningful to consider the coincidence of 
nodes ın close proximity across ımages ?! 


` It ıs possible to generate more strictly comparable images by fixing the position of 
nodes across visualizations Nevertheless, choosing an analytically appropriate constant 
position raises a new set of issues and such a strategy limits the package’s visual 
flexibility by fixing one of the four parameters that can convey information 
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TABLE B1 
DESCRIPTIVE STATISTICS 








VARIABLE 


Accum advantage 
Firm degree 
Partner degree 
Firm experience 
New partner 


Partner experience 


Prior ties 
Prior experience 
Homophily 
Collab distance 
Age difference 
Size difference 
Gov similarity 
Co-location 
Partner’s partner 
Collab distance 
Age difference 
Size difference 
Gov similanty 
Co-location 
Follow-the-trend 
Dominant trend 
Dominant type 
Multiconnectivity 
Firm cohesion 
Partner cohesion 
Shared cohesion 
Tie diversity 
Firm 
Partner 
Prospective 


Partner’s partner 


Controls 
Type research 
Finance 
Comercial 
Licensing 

Form biomedical 
Univ /nonprofit 
Government 
Venture capital 
Pharmaceutical 

Age i 

Size 

Governance 


ONE-MODE ATTACHMENTS 


New 
Mean SD 

1836 1466 
701 7 94 
7 80 4 38 
57 50 
5 69 449 
8 74 5 87 
6 32 5 23 
581 1,215 
51 50 
11 34 
438 658 
249 439 
407 1,131 
86 31 
21 38 
04 02 
20 27 
424 111 
242 186 
1 24 1 67 
73 18 
47 34 
77 12 
32 41 
31 46 
16 37 
27 44 
26 44 
971 561 
569 1,200 
77 42 


Repeat 
Mean SD 
4110 1869 
1705 1393 
12 24 3 98 

10 30 

8 57 4 05 
6 10 4 18 
4 84 2 79 
15 72 7 05 
6 99 521 
2,437 2,100 
77 42 
27 44 

9 27 6 95 
3 76 415 
1,425 1,794 

85 23 

56 38 

04 02 

27 22 

4 85 74 
411 1 20 
2 86 2 06 

85 07 

75 14 

85 06 

79 26 

21 41 

28 45 

25 43 

25 44 
14 93 538 
2,588 2,165 

95 21 


Two-MODE ATTACHMENTS 


New 

Mean SD 
1446 1080 
99 2 49 
6 22 3 76 
37 48 
3 36 351 
8 83 747 
447 6 45 
2 46 4 02 
152 631 
82 37 
24 41 
07 09 
13 32 
3 88 1 22 
74 111 
69 101 
66 21 
05 16 
71 15 
34 39 
22 42 
47 50 
18 38 
13 34 
25 43 
15 36 
04 20 
33 47 
23 42 
7 67 457 
192 584 
68 47 


Repeat 
Mean SD 
23 23 15 79 

741 1120 
9 36 3 96 
06 24 
562 3 26 
581 4 33 
4 95 3 56 
1480 1107 
6 92 6 14 
259 3 50 
399 1,175 

83 24 

59 38 

10 11 

17 55 

4 49 1 00 
278 184 
2 67 176 

77 12 

27 30 

77 12 

73 28 

33 47 

27 44 

24 43 

16 37 

10 29 

15 35 

10 30 

25 44 

40 49 
11 10 5 14 

681 1,449 

88 33 





TABLE B2 
ONE-MODE ATTACHMENTS MCFADDEN MULTIPROBABILITY MODEL ESTIMATES 





NEW ATTACHMENTS REPEAT ATTACHMENTS 


(1) (2) (3) (4) (5) 


Independent variables 
Accumulatıve advantage 
Partner degree 04127” 06187” 0603** — 0037 0014 
( 0058) ( 0085) ( 0086) (0107) (0151) 
New Partner 1558 1463 1550 2 3936** 2 4770** 
( 1177) ( 1172) ( 1172) ( 5917) ( 6080) 
— 1429** — 1529** 
Partner experience — 0537** —.0474** — 0477** 
(0128) ( 0128) (0130) (0420) (0447) 
Prior ties 3286** 3799** 
(0556) ( 0602) 
Prior experience — 2984** — 3632** 
(0947) ( 1027) 
Homophily 
Collaborative distance — 0270* — 0297* — 0262* 0513* 0684* 
(0142) (0149) ( 0153) ( 0293) ( 0311) 
Age difference 035377 0336** 0492** 1288** 11177" 
( 0088) ( 0097) ( 0115) ( 0337) (0344) 
Size difference 0182** 0163** 0303** 0011 0012 
( 0045) ( 0046) ( 0062) (0011) ( 0012) 
Governance similarity — 1458 — 1238 — 1323 7119* 4917 
( 0892) ( 0901) ( 0906) ( 3909) ( 4140) 
Co-location 6681** 6576** 6566** 7379* 8580** 
(0947) ( 0952) ( 0954) (2891) ( 2960) 
Partner's partner 
Collaborative distance — 0027 — 0023 — 0021 5245** 5079** 
(0144) (0145) (0146) (0584) (0601) 
Age dıfference — 0192 — 0116 0830* 0612 0893* 
(0118) (0119) (0331) (0409) (0437) 
Sıze dıfference — 0089 — 0083 — 0504* — 00527” — 0425” 
( 0056) ( 0056) ( 0247) ( 0018) ( 0188) 
Governance sımılarıty — 0237 — 0143 — 0160 6094 101797 
( 1429) ( 1443) ( 1450) ( 5605) (5770) 
Co-locatıon 25797 — 23747 2210” — 7763 — 9146” 
( 1334) ( 1340) ( 1343) ( 4950) (5146) 
Follow-the-trend 
Dominant trend 224007** 164288** Oo 153775** —215871*  —250997** 
(2 4351) (3 3135) (3 3216) (10 0363) (10 3612) 
Domunant type 27150**  21221** 2 1213”” 3 8823** 3 9045** 
( 1515) ( 1617) ( 1632) ( 6569) (6931) 
Multiconnectivity 
Partner cohesion 3594** 0419 0543 9512** 1 0824** 
( 0457) ( 1481) ( 1466) (1919) (2093) 
Shared cohesion 0566* 1518** 1458** 0978 0783 
(0271) (0535) (0535) (0799) ( 0824) 
Partner tie diversity 20492** 2 8440** 2 1283” 15821 1 9676 
(2785) (9773) (1 0408) (1 4550) (1 4345) 
Prospective tie diversity —2 7991 20 25027” 82 7609** —303778** —295611** 
(15170) (5 6211) (214090) (106104) (105284) 
Partner’s partner tie 
diversity 4118 1 0324* 8897 —58047** | —68265** 
( 4286) ( 6031) (6018) (1 4601) (1 6196) 
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NEW ATTACHMENTS REPEAT ATTACHMENTS 
(1) (2) (3) (4) (5) 
Contingenctes * 
Firm age X partner degree — 0021** — 0021** 
( 0005) ( 0006) 
Firm age x shared cohesion — 0107* — 0103* 
( 0048) ( 0048) 
Firm cohesion x partner cohe- 
sion 0975** 1275** 
( 0346) (0357) 


Firm cohesion x prospective di- 
versity —8 9460** —27 6005** 


(2 0969) (6 4403) 
Period and timeline effects 


Period 3 x age 


difference — 0423* 
( 0186) 
Period 3 x size 
difference — 0184** 
( 0068) 
Period 3 x partner 
cohesion — 2068* 
( 0835) 
Period 3 x partner 
diversity 4 0306* 
(1 9870) 
Period 3 x prospective 
diversity —69 9582** 
(22 3349) 
Period 3 x firm cohesion x 
prospective diversity 214625** 
(6 9130) 
Timeline x partner’s partner 
age difference — 0106** 
( 0037) 
Timeline x partner's partner 
size difference 0044* 
( 0024) 
LRx'(df) 1,498 1,692 1,767 498 522 
(21) (27) (35) (23) (25) 
Pseudo-R’ 17 19 21 60 64 
N 966 373 





NOTE —All models include fixed effects for firm, year, and type Measure of firm size 1s divided by 
100 


* Contingencies for activity type were included in models 2 and 3 They are available from Kenneth 
Koput 

*P< 10 

* P< 05 

** P< Ol 
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TABLE B3 


Two-MODE ATTACHMENTS MCFADDEN MULTIPROBABILITY MODEL ESTIMATES 








NEw ATTACHMENTS 


REPEAT ATTACHMENTS 





(1) (2) (3) (4) (5) (6) 
Independent variables 
Accumulative advantage 
Partner degree — 0859** 0128 0142 0072 0122 0062 
( 0100) ( 0100) ( 0106) ( 0079) ( 0081) ( 0082) 
New partner 5081** 7974** 6150** — 2385 — 2313 — 2323 
( 1167) (1350) ( 1669) (5891) (5772) (5799) 
Partner experience — 2140** — 159578 — 1381** — 145977 = 15276** — 1819** 
( 0150) (0173) (0179) ( 0366) ( 0369) ( 0379) 
Prior ties 2327** 2448** 2470** 
(0162) (0168) ( 0168) 
Prior Experience — 1480”” 0921 1108 
( 0265) (0717) ( 0726) 
Homophily 
Collaborative distance 1794** 0386** 0449** 0168* 0120 0188* 
(0124) (0126) (0127) (0100) (0102) (0104) 
Partners partner 
Tie distance — 0836** — 0089 — 0083 2766** 2806** 3077** 
(0124) (0124) (0124) (0221) ( 0234) ( 0248) 
Age difference 0048 0435** 0430** 1497** 152579 1498** 
(0133) ( 0163) ( 0162) ( 0229) ( 0235) ( 0237) 
Size difference 0033* 0441** 0415** — 0088 — 0048 — 0139 
( 0013) (0122) ( 0123) (0157) (0157) ( 0162) 
Governance similarity 2690* 2316 2685 — 6362** — 6019* — 6873** 
( 1232) ( 1898) ( 1903) (2451) ( 2536) ( 2550) 
Co-location 0938 — 0934 — 1475 —2 0462** —2 1407** —2 1807** 
( 1327) (1763) ( 1765) ( 2758) (2857) ( 2870) 
Follow-the-trend 
Dominant trend 6 2637** 3 8608** —3 7647** —12 2855** —12 1924** —12 3164** 
(4811) (10541) (1 0656) (1 1006) (13070) (13330) 
Dominant type 2 5418** 1 5736** 1 5962** 2 9147** 3 0110** 3 0718** 
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Multiconnectivity 
Partner cohesion 


Shared coheston 


Partner tie diversity 


Prospective tie diversity 


Partner’s partner tie diversity 


Contingencies * 


Firm cohesion X prior experience 

Firm cohesion x partner cohesion 

Firm cohesion x shared cohesion 

Firm cohesion x partner tie diversity 

Firm cohesion x prospective diversity 

Firm cohesion x partner’s partner tie diversity 
Firm tie diversity x partner tie diversity 

Firm tie diversity x prospective diversity 


Firm tie diversity X partner’s partner tie diversity 


Period and timeline effects 
Period 3 x new partner 


Period 3 x prospective diversity 
Period 3 x firm cohesion x partner diversity 


Period 3 x firm cohesion x prospective diversity 


( 1288) 


5114** 
(0622) 
16624** 
(0701) 
3 6399** 
(2189) 

—12 4229** 
( 6607) 
—13459** 
( 3385) 


(1533) 


2 1824** 
( 1907) 
4 3605** 
( 2038) 
1 0723 
( 7584) 
4 8200** 
(16773) 
—8 0500** 
(7161) 


— 64294* 
(0654) 
— 1650** 
( 0624) 

— 1523 
( 1834) 
—5 1134** 
( 8105) 

9610** 
( 1631) 
—2 4258** 
( 9173) 
—13 3864** 
(2 6567) 
—19205** 
( 7013) 


( 1541) 


3 29161** 
(5708) 

4 423** 

( 2183) 

2 9030** 
( 2002) 

1 1967 
(18764) 
—8 3912** 
(7372) 


—11419** 
(2234) 
— 1655“ 
( 0648) 
— 9175** 
( 2418) 
—3 3183** 
( 9022) 
1 0233** 
( 1673) 
—1 57867 
(9318) 
—17 1266** 
(2 9355) 
—1 7864* 
( 7050) 


5656* 

( 2230) 
11 7024** 
(4 0933) 
1 2202** 

( 3217) 
—6 0453** 


( 2070) (2136) 
0771 3877* 
(1064) (1871) 
6444** 1 0000** 
( 1239) ( 1889) 
1 9538** 1 8883** 
( 3255) ( 3400) 
—4 8813** 7 87167 
(1 6806) (4 2148) 
—4 1571” — 9626 
(8235) (2 3109) 
— 0607** 
(0162) 
— 1186” 
(0572) 
—22 3118** 
(7 7478) 
—7 0456* 
(3 1050) 


( 2169) 


1 5000 
(7250) 
2 1678** 
(5974) 
—1303* 
(7559) 
9 9173* 
(4 6065) 
—5 2848* 
(2 7551) 


— 0641** 
(0163) 
— 4833* 
( 2400) 


—26 5231** 
(8 3263) 

—7 5804* 
(3 2204) 


O0cT 


(1 864 


Period 3 x firm diversity x prospective diversity 14 27C 
(6 972 
Timelme x partner cohesion — 136 
(061 
Timeline x firm cohesion x partner cohesion 057 
(023 


Timeline x shared cohesion 
Timeline x partner tie diversity 


Timeline x partner’s partner tie diversity 


LR x (df 5,970 8,893 9,05 

(17) (40) (48) 
Pseudo-R? 32 49 51 
N 1,500 


NoTE —All models include fixed effects for firm, year, and type, as well as main effects for form Meg 


* Form and type contingenctes were also included ın the model They are available from Kenneth Koy 
* P< 10 

* P< 05 

** P< 0l 
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Increased interest 1n longitudinal social networks and the recognition 
that visualization fosters theoretical insight create a need for dy- 
namic network visualizations, or network “movies ” This article con- 
fronts theoretical questions surrounding the temporal representa- 
tions of social networks and technical questions about how best to 
lınk network change to changes ın the graphical representation The 
authors divide network movies into (1) static fizp books, where node 
position remains constant but edges cumulate over time, and (2) 
dynamic movies, where nodes move as a function of changes ın 
relations Flip books are particularly useful 1n contexts where re- 
lations are sparse For more connected networks, movies are often 
more appropriate Three empirical examples demonstrate the ad- 
vantages of different movie styles A new software program for 
creating network movies 1s discussed ın the appendix 


INTRODUCTION 


Ranging from sumple histograms to dynamic images of the birth of galaxies 
(Abel, Bryan, and Norman 2000), visualization tools have always been 
key elements ın scientific advancement The ability to see data clearly 
creates a capacity for building intuition that 15 unsurpassed by summary 
statistics Wide ranges of distributional shapes (Handcock and Morris 
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1999), nonlinear relations, or spatial (geographic or social) proximity can 
be easily summarized with an image that helps scientists develop theory 
While the basic principle of substantive clarity 1s key to successful data 
visualization, work 1n this field 1s often as much art as science To build 
network visualization tools, we need to examine carefully questions about 
the meaning and implication of time 1n the formation of social networks 
In this article, we hope to extend a bit more science into the art of dynamic 
network visualizations and further the theoretical discussion about social 
network dynamics 

Social network research has made extensive use of visualization since 
Moreno first introduced the sociogram (Brandes, Raab, and Wagner 2001, 
Freeman 2000a, 2000b, Freeman, Webster, and Kirke 1998) Actors are 
usually represented as points, and relations among actors are represented 
by lines, with relational direction indicated by arrows Early sociograms 
were drawn by hand (Whyte 1943, Coleman 1961), and the layout was 
determined by the artistic and analytic eye of the author Such early graphs 
were usually simple, having few relations per person or a clear hierarchical 
structure 

The state of the art has progressed remarkably since Whyte and Co- 
leman, and a growing body of research has developed around various 
definitions for optimal network layout (Brandes et al 2001, Freeman 
2000b, McGrath et al 1997) Most network images do a poor job of 
representing change 1n networks, and researchers make do by presenting 
successive snapshots of the network over time (Bearman and Everett 1993, 
Powell et al 2005, Roy 1983) The problem 1s fundamental to the media 
To effectively display the relational structure of a social network, at least 
two dimensions are needed to represent proximity, and that leaves no 
effective space (on a printed page) to represent time ^ However, recent 
media advances allow us to use space to represent social distance and 
movement to represent change over time (Bender-deMoll and McFarland 
2002, Freeman 20002) 


2 There have been a number of creative attempts to overcome this limitation, such as 
producing multiple images of the network and placing them “next” to each other 
temporally (see, eg, http //www stanford edu/group/esrg/sılıconvalley/docs/netvvork- 
memo pdf) These approaches require a fair amount of reader training to see the 
difference between each plotted time frame The less-than-intuitive results follow be- 
cause these approaches are attempting to replicate a dynamic process ın a static me- 
dium Our approach is to use the dynamic nature of such networks directly, thereby 
producing a more readily interpretable visualization 
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RESEARCH ON LONGITUDINAL SOCIAL NETWORKS 


Much of the recent ınterest ın longitudinal social networks revolves 
around understanding how networks develop and change, as scientists 
seek to build models of social processes that result 1n observed structures 
(Doreian et al 1996, Leenders 1996, Nakao and Romney 1993, Snijders 
1998, Suitor et al 1997, Weesie and Flap 1990, Zeggelink et al 1996) 
Such dynamics are important for understanding network stability, which 
is necessary for understanding the effect of networks on individual and 
group behavior The clear importance of such questions has prompted a 
good deal of methodological research on network change For example, 
Snijders (1996, 1998, 2001) has developed models for evaluating the extent 
of change 1n a social network, conditional on structural features of the 
graph 

The purpose of dynamic network visualizations 1s to help augment 
theoretical intuition provided by summary statistics and standard static 
visualizations Until now, visualizations of network change have tended 
to take two forms The first common visualization approach plots network 
summary statistics as line graphs over time For example, Doreian et al 
(1996) present change ın reciprocity and transitivity for the Newcomb 
data (see also Gould 2002) However, such summary statistics provide 
information on a single dimension of a network’s structure For example, 
one might find that a network reaches a given equilibrium transitivity 
level, but since transitivity 1s a single average for the graph as a whole, 
we cannot know if this—zn ztself—means the graph 1s now relationally 
stable The second common visualization approach 1s to examine separate 
images of the network at each point ın time Unfortunately, such images 
are often difficult to interpret, since 1t 1s 1mpossible to identify the sequence 
linking node position ın one frame to position ın the next 


THEORETICAL IMPLICATIONS OF NETWORK. DYNAMICS 


A standing critique of social network research has focused on a “structural 
bias” that ımplıcıtly denies much of the dynamic nature of social relations 
(Emirbayer 1997, Emirbayer and Goodwin 1994) For some types of re- 
lations (such as conversations that occur 1n real time), one could argue 
that the networks are largely artificial constructions built by aggregating 
dead past events The network "structure" as such only emerges from this 
aggregation While we do not think this argument should be pushed too 
far, ıt raises important questions about kow the temporal embeddedness 
of relations defines a dynamic social space While discussions of meaning 
and temporal abstraction in themselves are not new (Abbott 1997, Bear- 
man et al 1999, Danto 1985), our goal 1s to 1dentify a meaningful way 
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to capture network dynamics empirically that simultaneously allows for 
the importance of past relations typically captured in static network 
images 


Basic Terms 


Much research on social networks 1s filled with static nouns, such as 
“roles,” “relations,” “obligations,” and so forth Longitudinal research on 
social networks requires a different set of process terms, such as “ritual,” 
“dance,” “pulse,” “tempo,” “congealing,” or “dispersal ” The key distinction 
is that an apparently static network pattern often emerges through a set 
of temporal interactions, with important implications for the relational 
process under investigation For example, when viewed in continuous 
time, networks may develop by spurts or build slowly and steadily, or 
they may reflect repeated ritual behaviors that mix moments of order and 
chaos 

The most basic dimension for dynamic relations is relatzonal pace 
These are questions concerning the rate of change 1n relations, with par- 
ticular interest 1n irregularities The pace of relation formation can thus 
be described with respect to levels (fast, slow), change (accelerating, de- 
celerating), or stability (cascades, jumps and starts, etc ) Clearly, the exact 
meaning of such terms depends on context, since the relevant scale will 
vary across types of relations Think, for example, of seconds for con- 
versation networks, weeks for friendship networks, decades for world 
trade networks, and generations for kinship networks (Nadel 1955, Collins 
1981) 

A second specifically temporal aspect of relations revolves around the 
order of relations, or their sequence In many circumstances, being able 
to explain the prevalence of given structures depends on identifying the 
order in which relations occur For example, the distinguishing charac- 
teristics of Johnsen’s (1985, 1986) process agreement models for friendship 
formation depend on whether one first forms reciprocal friends, which 
then generates agreement on third parties and creates transitive ties, or 
if two people first agree on their admiration of a third, which draws them 
together ın friendship Similarly, Chase’s (1980) explanation for the de- 
velopment of hierarchy 1n social relations rests on a particular order of 
relationship unfolding 

A special case of the intersection of pace and sequence 1s found ın 
notions of concurrency developed for disease networks (Kretzschmar and 
Morris 1996, Morris 1993) Two relations are concurrent if they overlap 
ın time and share at least one person While static views of networks 
focus on multiplicity (the overlap of types of relations), concurrency 1s 
effectively “temporal multiplicity,” which can dramatically complicate our 
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understanding of relations Moody (2002) shows that the temporal un- 
folding of relations reduces reachability, which has implications for any 
measures on networks with relations enacted ın real tıme Consider, for 
example, the different disease implications of a relational “switching” 
model, where actors retain past relations until they are secure 1n new 
relations, leading to a sequence of short-duration overlaps and a multiple 
partner model, where people overlap for extended periods of time with 
multiple partners (Laumann and Youm 1999) 

Finally, the richness of a relational structure further expands when we 
link relational timing to types of tie When we allow simultaneous con- 
sideration of the pace and sequence of ties with variation across types of 
ties, we can start to tap questions of how sets of interaction sequences 
transform into stable relations (see Nadel 1955) For example, if theoretical 
discussions of local action (Leifer 1988, Padgett and Ansell 1993) are 
correct, then we should be able to map social interaction as a sequence 
of seemingly random “mulling around” that quickly cascades once role 
positions are fixed This type of rich network visualization provides a 
more holistic understanding of the network’s structure than any single- 
dimensional index could 

These are just a small number of the theoretical considerations evident 
when one takes time seriously 1n the modeling and understanding of social 
networks, and we expect that others will contribute much to this discus- 
sion Our belief ıs that any such discussions are best grounded in empirical 
investigation (Abbott 2000), which will be enhanced with a set of flexible 
tools for displaying dynamic network data 


REPRESENTING LONGITUDINAL NETWORK DATA 


To develop dynamic network images, we need to conceptualize clearly 
how time 1s encoded ın social networks We conceive of time ın two 
analytically distinct forms discrete and continuous Discrete renditions 
of time consist of cross-sectional snapshots of the network Hence, lon- 
gitudinal analysis focuses on the change from one network state to another 
without any (explicit) reference to the sequence of changes that generate 
change In such cases, a process 1s generally inferred from the total net- 
work change across time Due to research costs and design, most longı- 
tudinal network studies use discrete tıme Continuous renditions of time 
consist of streaming relational events or interactions recorded with exact 
starting and ending times Streaming relational events consist of sequential 
dyadic events or interactions whose visual representation should unfold 
as a continuous social process Continuous representations of time enable 
researchers to identify how overall network changes emerge through or- 
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dered dyadic events But, researchers need not analyze the data ın its 
most reductive form, as they can aggregate relational events into larger 
time units (such as minutes instead of seconds) Hence, ıt 1s possible to 
develop discrete notions of time from continuous data but not vice versa 

If researchers have many panels of network data or continuous rep- 
resentations of network change, they have several decisions to make before 
they represent the data visually First, what constitutes therr networks? 
With discrete waves of sociometric surveys, each wave becomes the net- 
work used ın graphic representation However, with continuous data ıt 
becomes more difficult to define a network’s temporal boundary (Lau- 
mann, Marsden, and Prensky 1983)? Is the network defined by 10 rela- 
tional events, all the events 1n one minute, in 10 minutes, or 1n a single 
day? The answer to this may depend on the empirical focus of the re- 
searcher, but what 1s undeniable 1s that we cannot select each dyadic 
event as a network A network consists of a pattern of social relations, 
and therefore we must identify chunks of time that substantively capture 
the nature of relational events and the character of temporary networks 
that arise 1n the focal context * 

For visualizing continuous network data, we characterize a network 
by a tıme window that spans a set of relational events (Bender-deMoll 
and McFarland 2002) The relational events that transpire within a win- 


* One could argue that Laumann et al ’s (1983) discussion of nominal and realist notions 
of networks’ spatial boundaries also applies to networks’ temporal boundaries Realist 
notions of time will have natural boundaries that actors acknowledge and recognize 
(1e , school weeks and days, class periods, etc) Nominalist notions of time boundaries 
are defined by the researcher for a variety of theoretical concerns (1e , development 
focus, period of historical change, etc ) 


* There 1s a conceptual tension here regarding what a network consists of and how 
different kinds of representations will offer different understandings of network change 
This 1s à conceptual issue that runs to the heart of network theory The general 
structuralist view 1s that relations form from repeated interactions and their aggregate 
patterns (Nadel 1955, Hinde 1971) In contrast, interactiomsts argue that relations and 
roles are established by ritual patterns of interaction that have a particular sequence, 
such that reordering the sequence of interactions would undermine the meaning of the 
rtual and relation being enacted (Goffman 1967) Work in social movements and 
collective action suggest that ıt 1s not so much the accrual of everyday interactions 
that constitutes a relation but the rare, Jarring events which lock 1n or rewire relational 
patterns (Gould 1995) Finally, work on diffusion treats relations as conduits for the 
transferal of goods, and relational timing acts as a “switch” determining temporally 
“downstream” nodes from “upstream” nodes (Moody 2002, Morris and Kretzschmar 
1997) Regardless of the outlook, a more fine-grained perspective may reveal change 
processes in more substantive detail From the study of continuous relational events, 
we may learn how ordered dyadic acts can form ritual patterns emblematic of roles 
and relations (enriching the aggregate interaction view to include order), and how rare 
yarrıng events reverberate through relational structures so as to create network change, 
or how the unfolding of relations creates diffusion opportunities (Emirbayer 1997, 
Emirbayer and Goodwin 1994, McFarland and Bender-deMoll 2003) 
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dow of time are aggregated into a network One implication of the time 
window 1s that relational events have a residue that extends beyond their 
occurrence, defined by the length of the time window For example, ın 
the classroom data discussed below we have streaming interactions spec- 
ifiable to seconds Since many interactions take hold of actors’ attention 
for a short time after they occur (at least), we decided to make the time 
window equal to 2 5 minutes, which affords a relatively meaningful graph 
for characterizing the interaction pattern arising 1n that class (see fig 1) 
The *time window” defines a right open interval (from “slice start” to 
“slice end") Events that fall inside the interval (arcs 2—5, node C) or 
ongoing events that extend past the interval's bounds (nodes A, B) are 
included ın the network This method 1s quite flexible as it deals fairly 
well with the aggregation of both *instantaneous" events and those having 
a defined duration Successive networks are generated by incrementing 
the values for “slice start” and “slice end" points by a constant (delta) 
value 
After we define the window of time, the next decision 1s whether suc- 
cessively defined windows are to be overlapping or nonoverlapping ? For 
example, if we aggregate streaming dyadic relations into 2 5 minutes of 
class time for defining the network, we can make the next window consist 
of the next 2 5 minutes (nonoverlapping) or have tt partially overlap with 
the current one, as 1n a moving average (see Doreian 1980, 1986) The 
most yncremental relational change consists of a new window that only 
has one tie added to the front of 1t and another dropped off the back 
The third set of concerns center on the £zmzng of arc representation 
The placement of nodes ın a graph 1s distinct from the animation of a 
particular relation. One can use the network data to generate coordinates 
for nodes and then control the animation of when relations are shown by 
a different procedure We have found that the timing of arc representation 
helps viewers see the network process more clearly because it clarifies the 
link between relational change and node placement In essence, the an- 
imation of arcs follows a second time window, a “render window” that 
“slides” from one network time window to the next Nodes and arcs are 
drawn and deleted as the render moves over them, and the node coor- 
dinates are adjusted incrementally between the two layouts (see fig 2) 
The process of creating a visual transition from one network’s layout 
coordinates to another involves drawing a number of intermediate “ren- 
ders” of the network while gradually adjusting node coordinates and 
adding or deleting nodes and arcs as they fall within the moving render 
window Successive network windows may overlap, which 1s controlled 


“In SoNIA, we define the degree of overlap by a delta measure, which indicates how 
much time 1s added or subtracted from the prior window 
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Fic 1 —Construction of a “network slice” by binning node and arc events 


by the relationship between the duration of the time slice window and 
the offset between successive windows (bin delta) 

If the network data are continuous, adjusting the size of the render 
window 1n relation to the network window allows the researcher limited 
control over how many new relations tend to lead or lag node movement 
For example, A’s interaction with B can lead nodal position such that 
the arc transpires and then the nodes shift position, or we can lag inter- 
action such that nodes move closer to one another and then an arc forms 
(as 1f nodes first draw near each other and then talk) However, the actual 
timing of arc representation 1s determined by where an arc’s time coor- 
dinates land relative to those of the bin 


NETWORK VISUALIZATION 
Graph Layout Principles 


Good reviews of the history of network visualization can be found ın the 
literature (Freeman 20005) The effectiveness of network visualization 
technique differs by network size Small networks can focus on detailed 
elements of the graph structure while larger networks can mainly capture 
gross topology Visualizing networks of tens of thousands of nodes requires 
further abstraction yet ” Our interest ın this article is identifying layout 
principles that are useful for research 1n small to moderate sized networks 
(fewer than about 100 people) 

A useful goal for most social network layouts 1s to represent soczal 


” See app B (online only) in this article for a static example and Powell et al (2005) 
for a dynamic example 
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FIG 2 —Transitonmg between slices with a sliding “render window” 


distance as physical distance’ This representation allows viewers to get 
a spatial understanding of social relations, as nodes with many relations 
in common are placed close together on the printed page An intuitive 
impression of the network structure then emerges from the proximities 1n 
the image For example, one can easily see racial segregation in a network 
image as the clustering of races into distinct spaces on the page (Moody 
20015, p 683) A social-distance-based representation of network structure 
1s facilitated when edge lengths are equated to relational strengths Various 
force-directed layout techniques are usually successful at this * 

Within the larger topology, aesthetic features of the graph help facilitate 
readability All else equal, edge crossing, running edges “under” nodes that 
are not connected to each other, and stacking nodes on top of each other 


"For this article, social distance 1s defined graph theoretically as the length of the 
shortest path 1n the network connecting two nodes However, alternate formulations 
might be interesting and advisable depending on a researcher's theoretical framework 


8 There ıs a difficulty here when ties are directed, since the visual distance from : to 
7 has to equal the visual distance from 7 toz, even though social distance need not be 
symmetric In SoNIA, we use the basic correspondence between screen distance and 
social distance to evaluate layout fit, using a modification of Kruskall’s stress measure, 


X, (ScreenDist — Graph Disty 
Regu oie e o — 


£, GraphDıst 
where ScreenDist is the Euclidean distance of the layout x—y coordinates, and 
GraphDıst ıs the geodesic distance based on an underlying symmetric graph 
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all hinder readability of the graph (Davidson and Harel 1996) While we 
do not explicitly control these features 1n the movies below, they are key 
elements 1n our evaluation of the resulting graph layouts Color, size, and 
shape are useful ways to add additional information on actor or relational 
attributes We have found that visualizing multiplex ties—multiple ties 
of different types—can be effectively represented with colors and “trans- 
parent” arcs, which allow you to look through the edge to see multiple 
relations on a single dyad as a blend of colors 


Graph Layout Algorithms 


Force-directed or spring-embedder algorithms are among the most com- 
mon automatic network layout strategies These algorithms function sub- 
stantively on an analogy, treating the collection of nodes and arcs as a 
system of forces, and the layout as an “equilibrium state” of the system 
Generally, edges between nodes are represented as an attractive force (a 
“spring” pulling them together), while nodes that do not share a tie are 
pushed apart by some constraint to help prevent overlap The two most 
common layout algorithms are Kamada Kawai (KK, Kamada and Kawai 
1989) and Fruchterman Reingold (FR, Fruchterman and Reingold 1991) 
For FR, the underlying model roughly corresponds to one of electrostatic 
attraction ın which the attractive force between connected nodes 1s bal- 
anced by a repulsive force between all nodes For KK, it 1s as 1f all nodes 
are connected by springs with a resting length proportional to the shortest- 
path distance between them For both KK and FR, the relations between 
nodes must be expressed as distances rather than adjacencies In KK, this 
“dissimilarity” matrix 1s constructed from geodesic distances between 
nodes In FR, the dissimilarity matrix 1s constructed directly from 
adjacencies 

Both of these algorithms are available ın current network drawing 
software, such as Pajek (Batagel) and Mrvar 2001) and NetMiner (Cyram 
2003) The algorithms work by iterative optimization—adjusting a node’s 
position by reacting to the positions of others As such, the starting position 
of the network affects the outcome The details of the optimization pro- 
cedure can also affect layout Because there are no exact calculations of 
global minima, layouts are subject to local convergence problems 

A second class of algorithms plots nodes using the dimensions that result 
from multidimensional scaling (MDS) techniques based on the geodesic 
distances or some alternate measure of node similarities ? These layouts 
are available 1n common software packages, such as NetDraw, Krackplot, 
or MultıNet Substantively, metric MDS models reduce the n — 1 dimen- 


? Technically, KK could be considered a variant of nonmetric MDS 
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sions present ın a network to the two dimensions that capture most of 
the variance ın observed multidimensional distance Because they take 
input distances as their starting point, these models do not, ın themselves, 
attempt to stop nodes from stacking on top of each other, and as such 
they can easily generate layouts where structurally equivalent nodes oc- 
cupy the same location Many authors correct for this by adding a small 
amount of noise or other correction to the resulting structure One the- 
oretical value to the metric MDS model 1s the direct linkage between 
input relational distances and resulting display Nonmetric MDS algo- 
rithms use one of a number of optimization techniques to find a “low- 
stress” (well-balanced relation between input and screen distances) layout 

Because of the necessity of giving results in terms of two-dimensional 
distance, both force-directed and MDS models are symmetric, using either 
a symmetric input distance matrix (MDS) or having symmetric forces 
driving/pulling nodes together in force-directed layouts While the dis- 
tinction between direction and distance 1s irrelevant ın static layouts, it 
can be substantively important for dynamic layouts, as we often want 
nodes to follow their nominations To accommodate direction, we intro- 
duce an algorithm that follows a simple peer influence analogy, building 
on a suggestion ın Moody (2001a) A node’s position 1s a function of their 
prior position based on a self-weight and the average position of those 
they nominate (see fig 3) 

In figure 3, the thin black arrows indicate nominations, while the wider 
gray arrows indicate net force for each person's movement Person 2 
should move down 1n response to the force of ties sent to persons 1 and 
3, while 3 moves shghtly to the nght based on the net pull of 4 and 5 
over the pull of 2, and shghtly up as a function of the net pull of 2 and 
4 over 5 Similarly, node 6 moves toward the center of nodes 1, 3, and 5, 
based on the strong difference 1n their positions It 1s 1mportant to note 
that nodes 1, 4, and 5 should not move at all, since they send no ties 
elsewhere in the network 

The peer influence algorithm works by computing the weighted average 
of a node's current x and y positions and that of those that node nominates 
This process 1s iterative, adjusting each node's position to that of others 
multiple times '° As with the metric MDS layouts described above, the 
algorithm makes no explicit reference to resulting node positions, which 
can result 1n nodes stacked on top of each other The result 1s often similar 
in flavor to a metric MDS plot, which should not be surprising given their 


10 Tn practice, the number of needed iterations 1s quite small, as the procedure tends 
toward high “consensus” positions Similarly, high self-weights generate more stable 
movies 
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Fic 3—An example of the peer influence layout algorithm 


common roots (under a particular set of assumptions) to a general class 
of eigenvalue models (Friedkin 1998) 


EMERGING NETWORKS 


In many settings, the substantive question 1s the emergence of a cumu- 
lative network structure Relations ın such settings are temporally sparse, 
requiring the aggregation of many periods (1e, a large time window) to 
generate a meaningful image of the network structure For example, a 
high school romantic network viewed on any given day will consist largely 
of a set of completely disconnected dyads, since only a small number of 
students are involved in multiple romantic relations on any given day 
Viewed contemporaneously, this network 1s essentially structureless How- 
ever, sexually transmitted disease risk resides in the history of previous 
relations, which cumulate and thus provide pathways for disease trans- 
mission As such, we are substantively interested 1n both the cumulative 
structure of the network and the process through which the structure 
unfolds 

We have found that one of the most effective ways for displaying such 
sparse networks 1s to show how the network emerges over time, by adding 
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nodes and relations as they appear, but placing them ın the display plane 
based on the final aggregate structure The dynamic element then appears 
by revealing the network over time and seeing the pieces grow together 
As new nodes and relations enter the population, they are added to the 
graph To effectively distinguish current relations from past relations, we 
“shost” relations when they end (1e, fade out) 

As an example, consider the graph given ın figure 4 below, which 1s a 
small section of the sociology collaboration network (Moody 20023), re- 
cording all article collaborations (among this small sample) between 1963 
and 1999 Based only on the image 1n figure 4, 1t appears clear that node 
A 1s at the center of this network 

However, our impression of this network changes 1f we examine how 
it develops over time, as can be seen ın movie 1 (All movies ın this article 
may be viewed in the online copy of AJS ) Here we see that the relations 
connecting the full component only form much later 1n the unfolding of 
the network Note also that the pace of relations 1s evident m the movie, 
as the structure admits to a four-year “dormant” period between 1969 and 
1973 when no changes occur One's understanding of the betweenness of 
these center nodes changes once the temporal nature of the network 1s 
revealed 

As a second example, consider relations among all people ın the largest 
connected component of an adolescent romantic network (see Bearman, 
Moody, and Stovel [2004] for details of this structure's development, 
Moody [2002] for a discussion of dynamic diffusion implications) Figure 
6 shows all relations that were active in the 18 months prior to the 
interview 

The most striking feature of the static network 1s a large connected 
component, linking 288 students through romantic involvement When 
viewed dynamically, we see that this structure emerges quite late 1n the 
local history of the network and 1s never observed as a complete structure 
at any point ın time Moreover, a close review of the movie reveals that 
certain contemporaneous substructures never occur—namely four-cycles 
Prior work suggests that avoiding such four-cycles helps generate the 
sparse treelike structure of the network (Bearman et al 2004) In general, 
this flıp book approach 1s among the simplest network movies to 1mple- 
ment The technique will hkely be most effective when one 1s interested 
in the cumulative graph structure, or when one can meaningfully fix node 
position 1n an x-y space 
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Fic 4—Sociology coauthor selection, nodes colored by sex, size proportional to degree 


DYNAMIC NETWORK MOVIES 


Perhaps the theoretically most promising advance for dynamic network 
visualization lies ın using node movement to map changes ın the under- 
lying network structure Substantively we are often interested 1n endog- 
enous network processes Examples include identifying how conversations 
build 1n classrooms to transform an ordered teaching environment into a 
cacophonous menagerie, the development of powerful positions through 
business exchange, or the endogenous emergence of social structure from 
actor-based interaction rules In all of these cases, we suspect that the 
structure at time £ influences the structure at time £ + 1 ın systematic 
ways, and being able to visuahze the transformation of the structure can 
help identify the mechanisms through which such changes occur (Mc- 
Farland and Bender-deMoll 2003) 


Requirements for Meaningful Dynamic Layouts 


All of the aesthetic requirements for static graphs apply to dynamic 
graphs In addition, however, a number of factors are unique to dynamic 
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MOVIE 1 —Online 


representations that need to be addressed An animation or interpolation 
technique 1s needed to create meaningful movement between temporally 
adjacent network slices Most useful for this 1s a sinosodal animation 
technique that gradually interpolates the position of a node from one 
resting position to the next This interpolation 1s what helps the eye follow 
changes 1n the graph structure over time 

Given the ability to animate change ın the network structure, one must 
then assure that the layout at time ¢ + 1 1s linked to the layout at time £ 
to avoid meaningless movement in the graph While simple on its face, 
the separate application of standard layout algorithms to each time slice 
will rarely give a satisfactory result Instead, as network layouts usually 
have no inherent coordinate axes, the entire graph tends to “rotate” and 
“flip” m the display space A partial solution to the problem of spurious 
movement rests in developing an adequate starting position or “anchor” 
for the network that results in a meaningful orientation for the graph 
The anchor choice 1s not theoretically neutral, as it will affect the resulting 
layout Below we identify multiple anchor possibilities and discuss the 
implication of each 

In addition to providing meaningful movement hnked to relational 
change, a number of additional features of nodes and relations should be 
temporally variable as well A successful layout should be able to accom- 
modate changes 1n relational strength, type, and valence (expressed as 
edge attributes), as well as changes ın node attributes (expressed as shape, 
color, and size of nodes), so that temporal changes in attributes will also 
be apparent Finally, while static graphs are usually agnostic with respect 
to relational direction, 1n a dynamic setting we might want to distinguish 
movement based on tie direction 
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Anchoring Temporal Networks 


The first layout possibility 1s really not an anchor but the use of random 
starting positions, which we term “random anchoring ” Points are initially 
distributed randomly throughout the x-y plane, and the algorithm uses 
these ınıtıal starting positions In general, a random anchoring 1s effective 
only for determining the initial (tıme 1) layout, and using a new random 
set of starting coordinates at each observed time point usually results in 
meaningless motion However, one can test (qualitatively at least) the 
effect of starting position on eventual layout by running the same figure 
multiple times with new starting values 

Alternatively, one can use a constant fixed position as the starting an- 
chor for all times, resulting ın a “fixed anchor” The most common such 
anchor might be a simple circle A second common starting point would 
be based on some function of the overall graph structure For example, 
one might build a cumulative graph as the sum of all interaction over 
the observation period and generate initial layout positions from this 
aggregate graph Or, one might use a meaningful nonrelational distance 
metric for the starting conditions, such as the latitude and longitude of 
cities 1n a trade network or the seating chart 1n a classroom network 

In any such fixed anchor case, movement from time 1 to time 2 reflects 
differences 1n the structure of relations between time 1 and time 2, since 
starting positions are “held constant” across graph observations, which 
ereatly reduces superfluous movement Substantively, using the circle as 
a fixed start point can result in systematic distortion in the overall display 
fit Gf, eg, nodes that are often connected to each other are placed on 
opposite sides of the starting circle), though movement will still be con- 
sistently related to graph changes Using an aggregate graph layout po- 
sition can sometimes lend a bit more consistency, as each observation 
window starts “near” the bulk of the structure observed for the whole 
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period In practice, we have found these types of starting values most 
effective for the metric MDS and peer influence algorithms 

Finally, an obvious anchor for a network layout at tıme £ is the graph 
layout at time £ — 1 We term this a “chaining anchor,” and it fits well, 
substantively, with our intuitive notion of a network movie, as we are 
literally plotting the change 1n node position from time £ to time £ + 1 
Substantively, this model uses only the information embedded in prior 
positions for graph layout As we show below, the chaining anchor seems 
to work best with force-directed layout algorithms, particularly when 
network change 1s small, as starting from £ — 1 coordinates helps the 
algorithm find an optima at ¢ that 1s geometrically close But when used 
with the metric MDS or peer influence algorithms, we often see a network 
"inertia" effect, where nodes quickly converge on a very small portion of 
the total display space, resulting in a largely unreadable movie 


MOVIE EXEMPLARS 
Data 


In the examples that follow, we use data from three sources chosen to 
reflect a wide range of potential applications, moving from theoretical a 
priori simulations, to a well-known classical dynamic network, and finally 
to complex, multiple relations, streaming interaction data within class- 
rooms Below we first describe the three data sources and then describe 
the dynamic movies for each 


Simulating Social Balance 


Social balance theory encapsulates the folk notion that “a friend of a 
friend ıs a friend,” suggesting that people avoid friendships where their 
friends are not friends with each other and form friendships when others 
relink the pair Primarily a theory of relational change, social balance 
theory is a clear candidate for exploration with dynamic tools The es- 
sential prediction of much of the work on balance theory 1s that through 
a relational adjustment process, ordered social structure can emerge en- 
tirely endogenously (Davis 1963, 1970, Davis and Lemhardt 1972, Doreian 
et al 1996, Johnsen 1985) Part of the power of this model rests 1n the 
understanding that global implications follow from local relational action 
The simulation starts with a simple random network of 45 actors who 
each nominate (on average) four other people At each of 200 iterations, 
five randomly chosen nodes evaluate their local network with respect to 
transitivity, intransitivity, and reciprocity, nominations are changed 1f do- 
ing so increases the comfort of the actors’ overall position with respect 
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to these characteristics Actors favor relations that are transitive, avoid 
those that are intransitive, seek to reciprocate nominations, and avoid 
making long-term asymmetric nominations (Gould 2002) In addition, 
actors have a small preference for current ties over changing ties and do 
not have perfect vision, but instead evaluate the returns to changes ın 
their local network with a small amount of random error '' This model 
is useful for demonstrating the endogenous emergence of order from ran- 
domness and demonstrates how seemingly stable summary statistics on 
one network dimension can mask significant structural change on other 
dimensions, highlighting the holistic-view payoff to this technique For 
the purposes of the movie, we sample the network at every other iteration, 
resulting ın 100 discrete images of the network 


Newcomb Fraternity 


Newcomb’s fraternity 1s among the best-known dynamic data sets in 
common usage We use a version of this data that comes standard with 
UCINET (Borgatti et al 1999) The data consists of each student pro- 
viding friendship rankings for every other student in the fraternity Fol- 
lowing Dorelan et al (1996), we recode the original rank data so that the 
top four positive ties are retained as “friends,” but use the original rank 
data for summary transitivity measures These data provide a real-world 
complement to questions of social balance raised ın the simulation above 


McFarland Classrooms 


A third data set stems from McFarland’s repeated observations of social 
interactions in over 150 high school classrooms during the 1996—97 school 
year We show dynamic network representations of social interaction from 
two of these classes below The first class (class 173) 1s an accelerated 
trigonometry class at a magnet high school It 1s composed of tenth graders 
who are tightly controlled by an authoritarian male teacher The second 
class (class 182) 1s an honors algebra 2 class at the same magnet high 
school It 1s composed of mostly tenth graders (light gray), but also a few 
eleventh graders (dark gray), and it 1s taught by a progressive but bum- 
bling male teacher (McFarland 2001) 

The data on classroom interactions consists of streaming observations 
of conversation turns The conversation turns were recorded as pairs of 
senders and receivers and for types of content Speakers were viewed as 
directing their communication 1n one of two fashions (1) indirect sound- 


Hm The simulation is 1mplemented ın SAS IML (interactive matrix language) and is 
available from the first author 
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ings, such as lectures (where a teacher addresses all students), and (2) 
direct interactions that are focused on particular others Each type of 
directional speech 1s viewed as having different forms of network recep- 
tion—indirect speech reaches all bystanders as passive hearers, and dı- 
rected speech reaches the focal person as an active coauthor of the con- 
versation (Goffman 1981) 

Two types of interaction are found to prevail 1n classroom contexts 
task and sociable (McFarland 2003) Task interactions are those behaviors 
that pertain to.the ongoing teacher-prescribed task (content 1s academic) 
In contrast, sociable interactions concern everyday concerns of adoles- 
cents' social lives, such as parties, dating, social outings, plans, and so 
on While the content 1s the key distinction, it 1s often the case that these 
speech acts are distinguishable 1n style as well, where sociable behaviors 
are more playlike, fast paced, and free than the more constrained academic 
styles of speech during lessons (Cazden 1988) Last, observations also 
recorded when task and sociable forms of interaction were laminated with 
evaluative meaning Such evaluations were seen as being either positive 
or negative—either giving praise or attempting a reprimand (Ridgeway 
and Johnson 1990) ” 


RESULTS 
Example 1 Dynamic Order from Randomness 


Our first example 1s of the simulated balance process Figure 5 presents 
summary statistics over time, including the reciprocity and transitivity 
rates and the proportion of arcs that change from iteration to iteration 
Balance theory predicts that people should adjust their relations until 
they reach largely consonant friendship groups, at which point the net- 
work should stabilize around a pattern where everyone's ties are (largely) 
transitive The simulation summary evident ın figure 5 suggests this 1s 
essentially the case, as transitivity rises steadily for the first 100 iterations 
or so, then largely stabilizes after that Reciprocity quickly rises early ın 
the simulation, then falls briefly (around iteration 20), then continues to 


This coding method was piloted in two prior studies by McFarland (2001) and had 
a high degree of accuracy in more teacher-centered classroom lessons due to the turn- 
taking sequential nature of discourse In more chaotic classrooms, simultaneous turns 
at talk often prevented the observer from acquiring perfect accuracy However, some 
record was made of when such diminished accuracy was acquired The two class 
periods used in this paper were considered to have a high degree of accuracy ın their 
coding For the movies included below, we use 25-minute moving windows that 
overlap ın incremental shifts of 0 5 mmutes We believe this method best captures the 
fluidity of interaction patterns but diminishes the amount of artificial fluctuation across 
frames so that some continuity 1s had 
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Fic 5 —Recıprocıty and transitivity in a dynamic balance simulation 


rise steadily throughout the simulation * Movie 3 presents a dynamic 
visualization of this simulation. In this figure, blue ties represent asym- 
metric nominations and green ties represent symmetric relations 

'The movie shows the emergence of structure out of randomness The 
initial stages of the movie show very httle structure Instead, actors are 
simply *milling around”—as they adjust relations in response to the initial 
random tie allocation After about 50 iterations, two patterns emerge 
First, a set of initial “protogroups” forms around four-person cliques 
These small protogroups form the structural kernels that future relations 
orient around as time progresses Second, a small number of "stars"— 
people receiving a relatively large number of asymmetric ties—become 
evident Star status proves somewhat unstable, however, as the search 
for reciprocity means that nodes only nominate stars for a short time 
However, as a counterforce, the push for transitivity means that connected 
pairs will nominate thirds ın concert, resulting ın a continuous existence 
of stars in the setting About halfway through the simulation, 1t becomes 


13 The network movie provides a nice explanation for this pattern The drop ın reci- 
procity seems to occur because the initial rise ın reciprocity leads to a large number 
of intransitive triads As the number of intransitive triads increases, the transitivity- 
seeking pressure seems to overpower reciprocity seeking, and adjustments are made 
to build transitive ties, and then within the transitive sets, reciprocity increases This 
pattern 1s consistent with theoretical and empirical expectations on real networks 
(Sorensen and Hallinan 1976) 
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Movie 3 —Online 


clear that the network 1s no longer a purely random collection of nodes, 
but instead a loose federation of groups weakly connected through asym- 
metric ties At this point, the group-group structure becomes interesting, 
as high-degree actors change asymmetric ties between groups, changing 
the relative links of groups to the remaining network Here we see the 
component structure change, as groups break away from each other into 
smaller collections 

Throughout the remainder of the movie, the structure never crystallizes, 
but instead “bubbles” as nodes form links between different groups Nodes 
make nominations to others in the setting that are then built on by others 
in response to the changing conditions The resulting temporal network 
story, then, becomes one of changes building on change, and a group- 
level dance that emerges as some group members form ties to others That 
some of these changes are, surely, the result of the random component 
built into the simulation 1s less interesting than the fact that such random 
events can have systematic effects 1n the structuring of others’ responses 

While not 1ntended to model any particular real setting, the 1mage one 
gets from the resulting movie might well mirror the development of con- 
versation groups 1n a cocktail party, as sets of people mingle and even- 
tually find those they wish to speak with on a more engaging basis The 
simulation also fits well with recent work on the systematic effects of a 
small number of random ties (Newman 2000, Watts and Strogatz 1998) 

Technically, we consider this a successful visualization because almost 
all of the movement evident in the graph results directly from interpretable 
changes 1n underlying network structure The visualization used the KK 
layout algorithm and a chaın-based anchor (For alternative movies using 
different layout algorithms, see app B) 
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Example 2 Multiple Visions of Newcomb’s Fraternity 


The substantive story usually told about Newcomb’s fraternity 1s one of 
structural convergence as a group of new college students meet and form 
friendships (Doreian et al 1996, Newcomb 1961, White et al 1976) This 
description of the data rests on evaluations of summary measures (or 
aggregate blockmodels) of the network However, the “convergence” story 
of the Newcomb data 1s not uncontroversial since up to 20% of the ties 
change even 1n the last few weeks of observation Finding local fluidity 
and global stability (as measured 1n blocks or summary statistics) suggests 
that change moves through structurally equivalent actors Figure 6 plots 
the reciprocity and transitivity rate over the 15 observations 

Figure 6 plots the change in reciprocity and transitivity, using the coding 
scheme described 1n Doreian et al (1996) The figure suggests, and many 
have interpreted these data as showing, that the network reached a par- 
ticular level of reciprocity early, while transitivity increased throughout 
the 15-week observation period However, when combined with the arc 
change information, one could argue that reciprocity never converges ın 
any meaningful sense but instead fluctuates substantially over the entire 
observation period Similarly transitivity increases steadily, and the 
change 1s more than you would expect by chance (Doreian et al 1996, 
p 124), but the degree of change 1s not dramatic (moving from 075 to 
0 80) ^ How does our image of the Newcomb data differ when we view 
changes ın the entire network simultaneously? In the movies that follow, 
reciprocated nominations are colored green and given a width equal to 
the average of the two arcs Asymmetric ties are blue, with width equal 
to the strength of the nomination 

Movie 4 presents a KK algorithmic chain-anchor layout version of the 
Newcomb fraternity network, matching the version used ın the balance 
simulation above The first impression of this movie 1s that the network 
seems to change a great deal over time This ts partially due to the fact 
that, unlike the simulation example, the Newcomb data were collected 
in discrete waves, so there are large numbers of edge changes between 
subsequent networks Close examination suggests that many nodes are 
moving even when their relational ties do not change (though the ties of 
the people they are tied to change, and thus the whole system should 
change shape) Some of this movement, however, seems excessive and 
hard to follow Consider instead movie 5, which uses the FR layout 
algorithm 

Movement ın movie 5 1s more subdued, and the pattern of interaction 
Is somewhat easier to follow For example, one can see that nodes 1, 6, 


4 The transitivity measure used here 1s based on a sorted rank model 
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8, and 13 remain strongly connected to each other throughout the obser- 
vation period, occupying a small cluster at the rıght of the graph Nodes 
7, 12, and 4 form a cluster early ın the group's history, but node 4 then 
breaks with this group at about week 8, instead nominating nodes 17 and 
2 In general, there 1s still a great deal of movement ın this graph, and 
the large number of asymmetric ties suggests that we might gain some 
insight by using a layout method that accounts for this asymmetry 

Movie 6 uses the peer-influence layout algorithm, with a cumulative 
graph anchor (the starting position 1s based on a KK layout of the average 
positive tie value between every pair cumulated over the entire obser- 
vation period) Here we immediately see a quick break between those 
embedded ın largely symmetric relations and those hanging on to the edge 
of the structure Nodes 10 and 15, for example, quickly emerge as nodes 
on the edge of the social structure While they nominate each other sym- 
metrically early in the observation period, they lose interest in each other 
by the end Neither node receives top-five nominations from any other 
node ın the network Their nominations to others seem to dance around 
the graph, never resting for long on a single person The group-structure 
dynamics also become clearer, as node 17 seems to be a popular actor 
bridging the cluster formed around nodes 1, 6, 8, and 13 and a smaller 
(and less stable) cluster surrounding nodes 7 and 4 

Hence, the movies suggest that the overall structure does not converge 
on a single form, and that the process of change 1s heterogeneous, with 
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some actors forming stable relations while others dance between friends 
throughout the observation period This insight suggests disaggregating 
the network change statistics evident in figure 6, which we do 1n figure 
7 

Figure 7 presents the results of a cluster analysis on the sequence of 
network change scores (calculated as the correlation between each per- 
son's time £ and time £ — 1 nomination vector, to retain the full range of 
rank data) A three-cluster solution 1s instructive Two groups follow a 
simple convergence story, with their nominations getüng progressively 
more stable as time passes The first of these groups (group 1 ın fig 7) 
has seven members, including the cluster at the right of the movie (1, 6, 
13, 8), and presents a gradual convergence of nomination patterns, while 
the second (group 2, with sıx members) does not converge on stable nom- 
ination patterns until week 5 Finally, group 3 (with four members, ın- 
cluding the hanger-on nodes 10 and 15) never seems to settle on a par- 
ticular nomination pattern, but changes nominations steadily over the 
observation period 

The substantive advantage of the dynamic movie, in this case, 1s to 
identify a level of internal heterogeneity to the network evolution that 
had never been noticed before, given the focus on global network sum- 
mary statistics This type of exploratory interaction between visualization 
and analysis 1s one of the best reasons to push progress on dynamic 
network visualizations 


Example 3 Ritual and Rebellion in the Classroom 


Our final example presents a network representation of the streaming 
interaction data in two classrooms observed by McFarland Each inter- 
action observed is represented by a directed arc Statements that were 
directed “to all” appear as a “star” of links Direct interactions were 
weighted as 1 (thick lines), and indirect interactions were weighted as 
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1/2 (where = class sıze) Ties are coded as “social,” “task,” and “sanction” 
to control the red, green, and blue color values of the ties “Task” inter- 
actions appear black, “social” interactions as blue, “praise” as green, and 
“sanction” as red The networks are shown 1n slices of 2 5 minutes du- 
ration, so 1t 1s possible that multiple ties will exist between a given pair 
In all figures, the teacher 1s represented as a yellow node Gender 1s coded 
in the shape of the nodes (square = male, circle = female) and age in 
color (tenth graders are light gray, eleventh graders are dark gray) 


Class 173 Development of an Interaction Ritual 


This example presents an orderly classroom, and movie 7 demonstrates 
this order To start, figure 8 presents the summary graph scores for class 
173 comparable to those seen 1n the figures above The bottom axis reveals 
half-minute mcrements of class time so that a value of 70 indicates the 
thirty-fifth minute of a class period Viewing the figure, we see an 1m- 
mediate difference with the previous friendship networks transitivity 
seems much less important ın the conversation networks examined here 
On the other hand, reciprocity tends to be quite strong throughout the 
observation period, though 1t admits to a high variance over time Note 
that none of the summary graph statistics converge toward a particular 
value, suggesting that the interesting story 1n this setting deals with the 
shape of change 1n the network rather than with the shape of the resultant 
structure Relational change remains relatively constant until 35 minutes 
into the class period (see unit 70) when the level of interaction drops 
significantly 

The movie for this class shows the transition across two primary activity 
structures recitation and group quizzes Each type of segment calls upon 
students to organize their behaviors 1n different sequential and relational 
patterns In the opening phase of the class, the teacher does some main- 
tenance, collecting papers and making announcements as students slowly 
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discard their sociable routines (minutes 0—2) Then the teacher settles the 
class into a recitation routine concerning homework problems (minutes 
3-28) Once the recitation routine 1s established, a clear ritual structure 
emerges where the teacher broadcasts questions to the class, and 1ndi- 
vidual students sequentially reply to those questions as they move to the 
center of the network image (Mehan 1979, Cazden 1988) At around mın- 
ute 26 (see unit 52) the teacher makes a joke (sociable broadcast) and 
describes a humorous problem for the students to solve Here, the task 
retains the same structure of recitation but has a “blip” of bracketed social 
activity In the final phase of the class, there 1s a clear shift to dyads and 
tniads Here the teacher asks students to get into assigned pairs and triads 
to work quietly on their quiz This final segment shows lower rates of 
behavior that arise 1n an intermittent, localized nature Near the end of 
class, the teacher makes a final announcement, and then the students 
break off and leave 

This movie illustrates how interaction dynamics can be represented ın 
a network form that captures both the sequential and relational nature 
of ritual interaction Moreover, ıt illustrates how the coordination and 
mobilization of students through these routines 1s accomplished (1e, 
through some sanctions and sociable activity, ın this case) What we ob- 
serve are two distinct kinds of relational “dances” and clear switches across 
them 


Class 182 Ignoring authority 


Moving from order to disorder, we now turn to class 182, with the now- 
familiar summary scores presented in figure 9 As with the previous class- 
room, we find no general pattern toward convergence in these values 
The amount of change and transitivity are generally higher ın this class- 
room As will become clear, the higher transitivity levels follow from the 
generally higher level of sociable activities 
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Movie 8 shows the students in honors algebra 2 move through essen- 
tially two phases of classroom activity In the first phase, the teacher 
lectures on and then invites discussion on student test performance 
(minutes 0-15) The students ignore and deride this effort, so we see a 
great deal of sociable interaction and conflict About halfway through this 
shortened class period (to make time for an assembly), the teacher shifts 
into discussing and modeling homework problems with calculators This 
shift offers little change 1n form, as no one really listens and the sociable 
routines continue In the final 10 minutes of class, the teacher allows 
students to work alone in their groups, and here their sociable interactions 
can persist unabated (minutes 16-35) Throughout the class period, stu- 
dents stay involved 1n their social affairs, and the teacher 1s forced to 
interrupt them, but still fails to secure their attention 

The point of this movie 1s to show what disorder and lack of control 
look hke in dynamic form. While the movie shows that different patterns 
of relations arise with each shift in activity, 1t does not show as clear a 
shift as observed 1n the more controlled class above This makes sense 
since the class never really pays attention to the teacher's prescribed 
scripts for behavior What we see 1s a high degree of social chquing among 
students that never abates in spite of the teacher's sanctions efforts It 1s 
in the later phases of unabated social activity that we see the most stability 
in network form 

In comparison to the other movies of discrete time, the movies of class- 
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room interaction and continuous time seem to be qualitatively different 
This may be more of an artifact of the examples and slices of time we 
chose than of the methods The simulation on balance and Newcomb’s 
acquaintance story both show a progression 1n affective ties toward some 
end state and uses of discrete notions of time In classroom interactions, 
the ties are represented 1n minutes of continuous class time, and since the 
movie 1s specific to a single class period, there really ıs no story of pro- 
gressive equilibrium in affective ties Instead, the aim 1s to mobilize and 
coordinate collective action ın various formats (lecture, recitation, group 
work, student presentations, etc) and types of ties (task, social, etc ) 
Hence, states of equilibrium are specific to tasks like lecture, recitation, 
and group quizzes While the shifts in rates of network change somewhat 
align with changing ritual patterns, it 1s the movies that offer the most 
meaningful evidence of change 1n relational sequence and form 

By using longer time slices (of say, class periods) and relating them over 
the course of the school year, we may acquire something akin to the pattern 
of tie formation ın the Newcomb study or the balance simulation A stable 
pattern of behavior may eventually form 1n a classroom and act like a 
central tendency (norm) adopted across tasks The stability of this pattern 
may even depend on how close ties reach an equilibrium state of balance 
and transitivity Future work will explore this empirical question further 
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SUMMARY AND CONCLUSION 


This article has presented a first step toward developing a new class of 
dynamic network visualizations As with most methodological develop- 
ments ın social networks, the use and development of dynamic network 
movies 1s grounded 1n the substantive need present 1n empirical research 
To build intuition about network dynamics effectively, a fine-grained tool 
for visualizing relational change 1s necessary We have focused on two 
types of dynamic network visualizations network flip books and dynamic 
movies The flip books are a combination of fixed node layout and dy- 
namic social relations, where nodes remain ın a constant position and 
arcs fill in the holes among these nodes They are particularly effective 
at showing how sparse relational structures emerge from temporally dis- 
aggregated social interaction In contrast, fully dynamic network movies 
allow nodes to move as a function of relational change We have examined 
three substantive examples, and ın each case the movie makes ıt possible 
to observe directly a relational feature that would be lost 1f we focused 
only on summary statistics of the complete network 

The theoretical promise of network visualization rests 1n helping sci- 
entists see their data We hope that this type of visualization will spark 
theoretical development as people are able to engage their data 1n new 
ways Abbott (2000) makes a similar port, when he argues, “Important 
general theory always grows out of extensive empirical work, every great 
sociological theorist has been a datahead" (p 299) Social network movies 
allow a limited form of data abstraction and exploration We are neither 
embedded directly ın the relations themselves (which we could never do 
due to the constraints of time and place) nor forced to limit our obser- 
vations to one-dimensional summary statistics that filter out much of the 
interesting temporal and relational variation ın the data Instead, lıke 
much recent work in geography and meteorology, dynamic maps provide 
a combined synthesis of information, allowing one to view the relevant 
abstract features of a given interaction system 
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Technically, we have described the use and performance of a number 
of particular layout algorithms for building movies More detail on con- 
structing such movies can be found 1n the software description given 1n 
appendix A (Bender-deMoll and McFarland 2002) In many ways these 
algorithms are somewhat crude, so additional and ongoing research 1s 
necessary While we think it 1s much too early in the development of this 
form of data visualization to make strong claims for a superior method, 
a few summary statements of our experience are ın order In general, 
layout methods should seek to minimize movement such that any change 
can be directly related to a particular relational event As with static 
graph visualization procedures, simplicity is the goal While judicious use 
of colors, shapes, and sizes can help, too much of any of these elements 
leads to a confusing visual cacophony 

With respect to particular graph layout algorithms, we have invested 
the most time and had the most general success with the KK layout 
algorithm It 1s best at preventing node stacking, making 1t possible to 
disentangle relational patterns at any given moment 1n the movie KK 
suffers, however, from superfluous local movement generated by changes 
ın the behaviors of others Because KK pushes away from all nodes sı- 
multaneously, and due to details of its optimization procedure, nodes tend 
to “float” around a space, filling ın vacancies left by nodes that are drawn 
into new spaces This 1s evident 1n the Newcomb fraternity movie and 
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can be seen as well ın the movement of students around the teacher ın 
our classroom movies 

When asymmetric ties form a significant part of the relational story, 
the peer-influence-based model seems to work well, though we find ıt very 
sensitive to 1nitial conditions and anchor points When used with a con- 
stant graph layout anchor, such as ın the Newcomb example, movement 
is often usefully limited to the most direct changes in graph structure 
When applied to our classrooms, for example, students tend not to cycle 
around the initial circle, but remain fixed until they engage in a moving 
interaction (see app B, available in the online version of this article, for 
examples) 

The most obvious directions for extending network movies 1s to move 
from an exploratory data analysis stage to a confirmatory analytic mod- 
eling stage (McFarland and Bender-deMoll 2003) A good deal of headway 
could be made, for example, by linking network movies to statistical 
models of network change (Snyders 1998, 1990, 1996), or to models that 
generate positional confidence intervals for nodes (Hoff et al 2001) This 
linkage would help us judge the degree of change ın a network This 
linkage will be complicated, however, because much of the promise of 
exploratory analysis 1s to identify elements from the data that are not 
customarily included in global models We suspect, however, that a closer 
linkage between data and visualization will help us build better statistical 
models 


APPENDIX A 
Introduction to SoNIA Software 


SoNIA (Social Network Image Animator) 1s a Java-based software pack- 
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age for visualizing networks that change over time ^ SoNIA ıs being 
developed as an open-source freeware project and 1s available with limited 
documentation (see http //sonia stanford edu/) Work on SoNIA began ını- 
tially out of a desire for a set of visualization tools specifically tailored 
for dealing with network data that include explicit time information and 
push the limits of the more traditional single-matrix approach to network 
data collection Although time 1s dealt with to some degree 1n existing 
network packages such Pajek, 1t seemed useful to focus attention on al- 
gorithms and techniques aimed at creating dynamic visualizations or an- 
mated “movies” of network change Also, implementations of the most 
common layout algorithms rest on assumptions seldom discussed 1n the 
network literature SoNIA attempts to make some of these decisions more 
explicit, provides some limited criteria for assessing the accuracy of a 
given layout, and aims to produce layouts with a high degree of com- 
parability and replicability In addition, SoNIA serves a useful function 
as a “browser” for time-based network data, and ıts modular construction 
allows it to serve as test bed for the development and comparison of 
network visualization algorithms 

SoNIA has the ability to read in a somewhat limited version of Pajek’s 
arc-list-based “time network” format, and 1ts own arc-list/spreadsheet style 
format (son files) in which tab-delimited columns control node and arc 
attributes Once data are loaded into SoNIA, the user has the ability to 
choose a region of time to examine the duration and offset of the “slices” 
used to bin the data and the layout algorithm to be employed Currently 
implemented are versions of Kamada Kawai, Furchterman-Riengold, 
Moody’s Peer Influence, metric MDS, file coordinates, and circle Plans 
are to include GEM, Davidson-Harel, nonmetric MDS, and several other 
multidimensional projection techniques For each layout, SoNIA uses the 
user's bin criteria to divide up arc events in regions of time and aggregate 
them to form network matrices to be fed into the layout algorithm For 
most algorithms, these matrices are converted to dissimilarities and sym- 
metrized for use 1n distance computation Several options are provided 
for layout preprocessing (node starting positions, desired scaling, treat- 
ment of isolates), postprocessing (rescaling, centering, etc), as well as 
parameters specific to each algorithm For algorithms that use an opti- 
mization technique, options are provided to control the “cooling schedule” 
and stopping times, and feedback 1s provided about algorithm conver- 
gence Several algorithms (KK and FR) attempt to create layouts 1n which 


15 SoNIA is currently under development by Dan McFarland and Skye Bender-deMoll 
The software, source files, 1nstallation instructions, and additional documentation are 
available at http //sonia stanford edu/ Comments, suggestions, and programming as- 
sistance would be greatly appreciated, please contact skyebend@stanford edu 
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screen distances are proportional to values 1n the network matrix For 
most real-world networks, 1t 1s 1mpossible to create a zero-distortion rep- 
resentation However, the degree of distortion can be estimated using a 
modified version of Kruskal’s stress statistic to compare the matrix and 
screen distances This value and a “Shepard’s plot” showing the distri- 
bution of distortions can be displayed 

The animation procedure for transitioning between layouts 1s similar 
to “tweening,” ın which a user-specified number of frames are displayed 
showing gradual interpolation of nodes’ positions between successive lay- 
outs The transition takes place 1n “network time,” meaning that for real- 
valued continuous data, node and arc additions and deletions between 
the networks are shown 1n sequence as they occur In addition, the time 
coordinates for a “render slice” can be entered manually, allowing the user 
to “browse” the data by displaying any arbitrarily sized region of time 
Node and arc attributes such as color, size, shape, labeling, and position 
can be controlled from the input file (or to a limited degree in the program) 
and are permitted to vary over time SoNIA takes advantage of some of 
Java’s 2D capabilities, which means graphics can be antı-alıased, and 
arcs can have some degree of transparency (useful 1n situations were 
multiplex ties exist) SoNIA has the ability to save out QuickTime movie 
files of the network animation, a log file describing the sequence of op- 
erations and parameter settings used to create the layout, and a series of 
“slice” matrices for analysis 1n other SNA programs 

Because SoNIA 1s coded 1n Java, it 1s cross-platform compatible, and 
there are many possibilities for extending 1ts capabilities and integrating 
it with other packages SoNTA”s internal structure and Java’s modular 
object-oriented design make it relatively easy to add additional layout 
techniques and explore modifications In addition, source files are avail- 
able for inspection 1f the details of algorithm behavior are not clear from 
documentation However, a great deal of additional work 1s needed, par- 
ticularly 1n the area of extending the design to facilitate communication 
with other open-source projects and network software 
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